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Balancing Balance 

BASALDELLA ET AL., PAGE 301 

The vestibular and proprioceptive systems control body balance by targeting 
motor neuron populations regulating functionally distinct muscles and fiber 
types, and their inputs are shaped by competitive interactions between these 
two systems. 



Sniffing Out Specificity 

FU ETAL., PAGE 313 

A technique for linking specific chemical compounds within mixtures from 
bodily fluids to functionally distinct classes of sensory neurons sets the stage 
for assigning discrete functional identities to individual vomeronasal neurons 
as shown for a new family of murine female pheromones. 



Interferon with Neurodegeneration 

EJLERSKOV ET AL., PAGE 324 

Lack of the immunomodulatory cytokine interferon-p causes spontaneous neurodegeneration resembling sporadic Lewy 
body and Parkinson’s disease dementia due to defects in neuronal autophagy. 



Fat Chance for Survivai 

BAILEY ET AL., PAGE 340 

Lipid droplets can act as anti-oxidant organelles to protect Drosophila neural stem cells from hypoxia-triggered ROS by 
sequestering damage-vulnerable polyunsaturated fatty acids from plasma membranes. 



Slithering Buiids an Organ 

KUO ET AL., PAGE 394 

A form of migration named “slithering,” where cells undergo transient EMT, allows rapid assembly of pulmonary neurosensory 
organs and may explain why lung cancers arising from neuroendocrine cells are highly metastatic. 



Purposefui Protein Aggregation 

BERCHOWITZ ET AL., PAGE 406 

Amyloid-like aggregation of the RNA-binding protein Rim4 controls gameto- 
genesis by repressing message-specific translation. 



A Lasting Scar of Acute Infection 

DA FONSECA ET AL., PAGE 354 

Following a single episode of acute gastrointestinal infection, microbiota- 
dependent inflammation of fat tissue and damage to the lymphatic system 
disrupt the communication between the immune system and the tissue, 
causing an ’’immunological scar” that compromises tissue immunity and 
homeostasis in the long term. 




Cell 163, October 8, 2015 ©2015 Elsevier Inc. 261 




Cell 




Stick to Your Gut 

SANO ET AL., PAGE 381 
ATARASHI ET AL., PAGE 367 

The adhesion of specific members of the gut microbiome to epithelial cells in 
the small intestine initiates a signaling axis involving CD4+ T cells and serum 
amyloid A to generate mucosally protective Th17 cells, thereby highlighting 
the importance of location in the regulation of gut homeostasis. 



Protease Lego 

JOSHI ET AL., PAGE 419 

Sequential protein degradation driving the Caulobacter cell cycle relies on an 
adaptor hierarchy— priming of the protease by one adaptor recruits additional 
adaptors, and substrate degradation occurs based on the degree of adaptor 
assembly. 



Snapping into Shape 

TOMKO ET AL., PAGE 432 

The base and lid complexes of the functional proteasome can only associate once a single alpha helix from the last-to-be 
incorporated subunit of the lid complex snaps into place and triggers a large-scale conformational change. 



siRNAs One By One 

ZHAI ET AL., PAGE 445 

Rather than a single RNA transcript giving rise to multiple siRNAs, Pol IV transcribes multiple short RNAs that are further pro- 
cessed to generate mature products, thereby demonstrating a “one precursor, one product” model for Pol IV-dependent 
siRNA biogenesis in Arabidopsis. 



A Brainy Network 

MARKRAM ET AL., PAGE 456 

A large-scale digital reconstruction and simulation of neocortical microcircuitry suggests cellular and synaptic mechanisms 
that dynamically reconfigure the state of the network to support diverse strategies of information processing. 



Stratifying Breast Cancer 

CIRIELLO ET AL, PAGE 506 

A comprehensive analysis of breast tumor samples determines invasive 
lobular carcinoma as a molecularly distinct disease with characteristic genetic 
features, providing key information for patient stratification that may allow a 
more informed clinical follow-up. 



Nanoscaie Neurocircuitry in 3D 

SIGAL ET AL., PAGE 493 

Super-resolution imaging with multi-color reconstruction of synapses in brain 
tissue enables interrogation of neural circuitry at nanoscale. 
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A Spotless Mind 



Memories are difficult to pin down, in part because there is no 
single kind —they can be fragmentary or interwoven, stable or 
ephemeral, easily recalled or long dormant. Exploring how 
learning and memory are encoded by changes in neuronal 
structure is a pursuit that supplants the abstract with some- 
thing tangible, a toehold for understanding one of the most 
fundamental of experiences. 

Seeking out where a specific memory is made and system- 
atically erasing it by manipulating ensembles of synaptic 
structures is the great achievement of recent work by Haya- 
shi-Takagi et al. (201 5). They cleverly create a probe that con- 
tains a photoactivatable form of the small GTPase Rad (which 
when active induces spine shrinkage) and link it to a deletion 
mutant of PSD-95, which, together with a dendritic targeting 
element (DTE), endows the probe with the property of accu- 
mulating at activated synapses. With this combo of features, 
the probe targets recently potentiated spines and upon photo- 
stimulation will induce spine shrinkage. Using it, the authors 
are able to see the extensive synaptic remodeling that occurs 
in the motor cortex as mice learn a task involving balance and 
motor coordination. When the probe is activated following 
training, the mastery gained by the mice is lost, whereas 
targeting spines activated by a different motor learning task 
does not impact learning of the original one, despite the spatial 
overlap of the assemblies of activated synapses. 

The authors estimate that the erasure of the acquired motor 
memory entails removal of ~41 0,000 spines from ~4,700 
neurons, providing a glimpse of the complexity of neuronal 
structural changes responsible for task-specific learning. 
This raises interesting questions. How are these neurons 
linked in circuits? And how equivalent are the contributions 
of individual spines to the acquired memory? Ultimately, it 
will also be interesting to learn how this scale of structural 
change compares to that which might be formed in other 
brain regions by exposure to objects and events. Might it 
be possible to accelerate learning with photoactivatable 
probes designed to augment the formation of these synaptic 
ensembles? Or perhaps to engineer proteins that won’t 
require photostimulation for activation, but might instead 
be triggered remotely? Studies have achieved remote control 
of channel proteins by radio waves using genetically en- 
coded nanoparticles (Stanley et al., 2012; Stanley et al., 
2015). Such an approach, if possible, could target neurons 
more spatially separated than the ensembles studied by 
Hayashi-Takagi et al. Might it even be possible to make 
mice with a reset switch for specific episodic memories? 

Controlling behavior in mice with optogenetic tools is being 
deployed with ever greater precision and can now be used to 
intervene in behaviors once thought experimentally intrac- 
table, a recent example being the reactivation of neurons 
linked to a positive experience as a means of alleviating 
depression-like symptoms (Ramirez et al., 2015). Such 
interventions, however, are not yet possible in humans or 
primates, though steps in this direction are being made, 
including a recent study by Inoue et al. (2015) exploring 
optogenetic stimulation in macaque monkeys. Inoue et al. 
introduced channeirhodopsin via an adeno-associated virus 




Visualizing neuronal spine potentiation. Image courtesy of A. Haya- 
shi-Takagi. 



vector to the frontal eye field, a region of the primate frontal 
cortex that is part of the oculomotor system. The optogenetic 
stimulation of channeirhodopsin-expressing axons that 
project from this area to the superior colliculus triggers rapid 
involuntary eye movements known as saccades, demon- 
strating the feasibility of pathway-selective optogenetic 
stimulation in primates. Casting our glance to the scientific 
horizon, it seems that such studies bode well for translating 
the panoply of tools used in rodents to studies of behavior 
in a wider range of models, particularly those amenable to 
genetic modification (including macaques, Niu et al., 2014) 
or one day even to therapeutic applications in humans. 
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Pied Piper of Neuroscience 

Principles of Neurobiology 

Author: Liqun Luo 

New York, NY, USA: Garland Science (2015). 
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Forgive us for not disclosing the year, but 
when one of us was excited about the 
brain at a young age (and the other just 
born), neuroscience as an integrated 
discipline was still so new that one could 
hardly find a textbook. 

Neuroscience was then scattered in 
the standard medical curriculum of anat- 
omy, embryology, histology, biochem- 
istry, physiology, pharmacology, pathol- 
ogy, neurology, and psychiatry. Some 
teachers literally extracted relevant mate- 
rials from those traditional disciplines 
before adding a prefix “neuro-” to most 
of them. The exception was Harvard 
University, where Steve Kuffler and John 
Nicholls published From Neuron to Brain 
in 1976, basically using their teaching 
notes. For the progeny of Harvard who 
worshiped Kuffler, From Neuron to Brain 
was automatically embraced, though its 
style was highly personal and its content 
highly selective. Kuffler and Nicholls 
imprinted what they liked onto their stu- 
dents, who later spread the book when 
they established laboratories or depart- 
ments of neurobiology outside Harvard. 

When Eric Kandel and James Schwartz 
published Principies of Neurai Science 
in 1981 , it solved the problem of compre- 
hensiveness and became the standard 
neuroscience textbook. In its third edition 
in 1991, Tom Jessell strengthened the 
coverage on neural development, which, 
at that time, was taught mainly from 
Principies of Neurai Deveiopment by 
Dale Purves and Jeff Lichtman (1985). 
More textbooks, including Fundamentai 
Neuroscience by Squire et al.. Neurosci- 
ence by Purves et al., and Neuroscience 
Expioring the Brain by Bear et al., followed 
and each has been used by a significant 
readership. 

Why should anyone read Dr. Liqun 
Luo’s Principies of Neurobioiogyl Isn’t it 
suicidal to write a textbook in an already 
crowded market? 

The new book is distinct in its philoso- 
phy and authorship, which makes it valu- 



able for teaching and a pleasure to read. 
Luo shares with us the belief that students 
should not be taught just facts and 
knowledge but also how knowledge is 
obtained and how principles are derived. 
He advocates discovery-based teaching 
in his own course at Stanford and Peking 
Universities. This book is the product of 
his hard work to offer discovery-based 
teaching to more students. The target 
readership is senior undergraduate stu- 
dents interested in research, graduate 
students interested in neuroscience, and 
the minority of medical students who still 
intend to pursue research alongside clin- 
ical practice. 

It is an immense task for any individual 
to write a textbook for a discipline so large 
now that each recent annual meeting for 
the Society for Neuroscience in the United 
States alone has more than 30,000 at- 
tendees. If one wants to be sane and 
preserve one’s own time for research, 
one should ask others to join in writing a 
book. These partially explain why all major 
neuroscience textbooks are multi-auth- 
ored. Multiple authors also ensure exper- 
tise in diverse fields. Nevertheless, the 
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PRINCIPLES OF 
NEUROBIOLOGY 




advantage of single authorship is also 
obvious, as long as the author has the in- 
tellectual breadth and depth to cover 
different areas. A consistent writing style 
and a coherent, logical flow will certainly 
be helpful to readers. Luo has been heroic 
in writing the book over 5 years as a single 
author. In 13 chapters and 645 pages, he 
provides a masterpiece highlighting prin- 
ciples of how the nervous system trans- 
forms and governs our sensation, action, 
memory, and thought. 

Reading Principies of Neurobioiogy is 
like enjoying a beautiful piece of music. 
Luo often begins each section with inter- 
esting stories or classic experiments and 
ends with his comments on important 
but unsolved questions. We include a 
few stories here. 

Chapter 1 showcases nature versus 
nurture: a young chick exhibits innate 
escape behavior if an object flies toward 
it resembling a hawk but not when the 
same object flies toward it resembling a 
goose; juvenile barn owls, after wearing 
artificial prisms, could adjust their audi- 
tory map to match an altered visual 
map. Luo uses these stories to entice 
students to explore neurobiological 
mechanisms of behavior: how experi- 
ences remodel and sculpt neural circuits 
formed by genetic instruction. 

Chapters 2 to 1 1 form the core: 
signaling within neurons and across 
synapses, sensory/motor systems and 
how they wire together, sexual behavior, 
learning/memory and synaptic plasticity, 
and brain disorders. Unlike other neuro- 
science textbooks in which molecular, 
cellular, systems, and developmental 
neurobioiogy are treated separately, 
most topics in this book are question ori- 
ented with interdisciplinary approaches 
intertwined to address the same question. 
For example, in the section on olfaction, 
the author starts with an intriguing and 
remarkable story of salmon homing. A 
1976 field study demonstrates that adult 
salmon rely on the imprinted memory of 
smell to guide their homing to streams 
even after they have migrated many miles 
away. How are odor stimuli transformed 
into electrical signals? How do these sig- 
nals get processed, organized, and trans- 
mitted to the brain to form internal repre- 
sentation? How can some odors evoke 
stereotypic innate behaviors while others 
can be flexibly associated with learned 
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Image credit: Zhi Ye, Peking University. 

behaviors? How do distinct neural maps 
form during development and then how 
are they shaped by experiences? The 
author logically takes readers through 
the journey of major discoveries, colorfully 
filled with chemistry, molecular biology, 
genetics, electrophysiology, optical imag- 
ing, and behavioral assays. Instead of 
simply remembering facts or conclusions, 
students can have fun deriving conclu- 
sions from the original data. The pied 
piper is getting the student to follow with 
action. 

All sensory systems convert external 
stimuli into an internal representation in 
the brain, but they differ dramatically in 
the physical nature of the stimuli and the 
required processing. For example, vision 
has to convey color, as well as spatial in- 
formation, while the auditory system re- 
quires frequency separation and temporal 
precision. Are there the general principles 
shared between olfactory, visual, audi- 
tory, taste, and somatosensory systems? 



What are their unique mechanisms? 
Again, the discovery-based approach en- 
courages the reader to solve the puzzles 
one by one and arrive at a general under- 
standing of the sensory systems. This is 
but one of the many storylines that are 
thought provoking and enjoyable to read. 

The book ends with “evolution of 
the nervous system” (Chapter 12) and 
“ways of exploring” (Chapter 13). The 
evolution section asks how ion channels 
or sensory GPCRs evolved and when 
and how the mammalian neocortex un- 
derwent rapid expansion. It places neuro- 
biology in a perspective where diverse 
and independent strategies in different 
organisms were created to solve the 
common problem. 

The last chapter centers on technolo- 
gies in neurobiology. As Sydney Brenner 
remarked in 1980, “progress in science 
depends on new techniques, new discov- 
eries, and new ideas, probably in that 
order.” In fact, the career of the author 



himself partly echoes the comment. 
From the late 1990s to now, Luo has in- 
vented genetic methods in flies and mice 
to label and manipulate single neurons 
and to trace frans-synaptic connections, 
thus in an excellent position to provide 
an authoritative summary of neurotechni- 
ques, beyond those of optogenetics and 
CRISPR/Cas9. Regrettably, however, a 
bias is shown by the absence of biochem- 
ical approaches to neurobiology, which 
have played an important role in neurosci- 
ence such as the discoveries of the nerve 
growth factor and netrin. 

Taken together, Luo’s book is a pied 
piper’s flute for neuroscience students. 
As a single author taking up the challenge 
of discovery-based teaching, Luo has 
included a judicious choice of materials 
and achieved a fine balance of breadth 
and depth. Principle of Neurobiology will 
help teachers to inspire curious students. 
Once the students find neurobiology irre- 
sistible, they will find that no one can 
cover neuroscience in a single textbook, 
and motivated students can go on his/ 
her own path of learning more about 
neuroscience by reading other books 
that have upward of 1,700 pages and 
nearly 80 authors, specialized books 
such as Cognitive Neuroscience by 
Michael Gazzaniga, or better still original 
research papers. 

Most scientists are afraid of writing text- 
books, both because precious time can be 
spent on doing research and writing 
grants and because the reputation of au- 
thors of textbooks in the natural sciences 
almost always precedes, rather than 
follows, a textbook. Luo, through his 
Principles of Neurobiology, has set an 
example of how an outstanding scientist 
cares about education and writes a text- 
book to inspire students to pursue discov- 
eries. We hope that students will follow 
this pied piper, happily and fruitfully. 

Yulong LiT2,3 * and Yi RaoT2.3 * 

■'State Key Laboratory of Membrane 
Biology, Peking University School of Life 
Sciences, Beijing 100871, China 
2PKU-IDG/McGovern Institute for Brain 
Research, Beijing 100871, China 
^Peking-Tsinghua Center for Life Sciences, 
Beijing 100871, China 
*Correspondence: yulongli@pku.edu.cn 
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How Brain Fat Conquers Stress 
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A new paper by Bailey et al. reveals that lipid droplets, crucial organelles for energy storage, can 
also protect against oxidative stress. In Drosophila larvae, lipid droplets in glia allow neuronal 
stem cells to keep proliferating under hypoxic conditions. Protection likely involves sequestering 
vulnerable membrane lipids away from reactive oxygen species. 



Cells routinely encounter highly reactive 
oxygen derivatives, from endogenous 
and exogenous sources. These reactive 
oxygen species (ROS) mediate intracel- 
lular signaling but in excess cause severe 
damage to DNA, proteins, and lipids. 
Organisms combat such oxidative stress 
with a myriad of strategies, including 
small-molecule antioxidants and detoxi- 
fying enzymes. Somewhat paradoxically, 
increased ROS production also occurs in 
cells exposed to reduced oxygen levels 
(hypoxia), a particular problem for stem 
cells, as they tend to reside in hypoxic 
niches. A new paper in this issue of Cell 
(Bailey et al., 2015) uncovers a novel anti- 
oxidant strategy for stem cell protection. 
Here, glial cells harness fat storage organ- 
elles to protect neighboring neuronal stem 
cells, likely by sequestering vulnerable 
lipids away from membranes. This elegant 
paper sheds new light on lipid droplets, 
stem cells, and stress protection. 

Lipid droplets are the sites where cells 
store fat (Walther and Farese, 2012). 
Here, a layer of phospholipids and pro- 
teins surrounds a core of neutral lipids 
(e.g., triglycerides). Lipid droplets play 
central roles in lipid metabolism, storing 
energy, providing precursors for mem- 
brane lipids, and sequestering toxic lipids. 
Not surprisingly, droplet dysfunction is 
linked to many human diseases, including 
fatty liver, diabetes, and cardiovascular 
disease. 

Do lipid droplets also contribute to the 
complex lipid metabolism of the nervous 
system? So far, this possibility is poorly 
explored (Welte, 2015), though in adult 
flies and mice, inappropriate droplet 
accumulation in neurons or in glia is linked 
to neurodegeneration (Inloes et al., 2014; 
Liu et al., 2015). In the CNS of Drosophila 
larvae, droplets are abundant during 



normal development, though only in 
certain types of glia (Kis et al., 2015). 
Bailey et al. (2015) discovered that oxida- 
tive stress, as a result of hypoxia or ROS- 
generating chemicals, vastly increases 
droplet number specifically in the CNS. 

These larval glia serve as a niche for 
neuronal stem cells, the neuroblasts 
(Figure 1 , left). Under low-nutrient condi- 
tions, they secrete signals that allow 
neuroblasts to keep proliferating, even 
as Other cell lineages stop dividing (Cheng 
et al., 2011). Such “brain sparing” selec- 
tively protects the CNS, so that larvae 
can still develop into fully functional, 
albeit smaller, adults. By abolishing drop- 
lets specifically in glia, Bailey et al. (2015) 
now find that droplet accumulation pro- 
motes brain sparing under conditions of 
oxidative stress: glial lipid droplets allow 
neuroblasts to keep proliferating under 
hypoxic conditions. 

How does this protection work? One 
important type of ROS-induced damage 
is lipid peroxidation. Using a fluorescent 
sensor, the authors found that lipid perox- 
idation is higher in membranes than inside 
droplets and that membrane damage 
goes up dramatically when glia lack lipid 
droplets. Mass spectrometry revealed 
that, upon oxidative stress, fatty acids 
from membrane phospholipids are incor- 
porated into droplet-localized triglycer- 
ides. These observations suggest that, 
upon oxidative stress, vulnerable lipids 
are rerouted to lipid droplets where they 
are protected from ROS (Figure 1 , right). 

There is an interesting twist: although all 
types of fatty acids relocate to droplets to 
a similar extent, certain ones are selec- 
tively depleted from membranes, namely 
fatty acids with multiple double bounds, 
polyunsaturated fatty acids (PUFAs). 
PUFAs are particularly sensitive to lipid 



peroxidation, and thus the pool still in 
membranes is presumably preferentially 
damaged and lost. Glial droplets are 
induced particularly strongly on a high- 
PUFA diet, as if membranes with more 
vulnerable lipids require enhanced pro- 
tection. 

But is protection from lipid peroxidation 
responsible for continued neuroblast 
proliferation? Although a comprehensive 
answer will require identifying exactly 
how neuroblasts are compromised, avail- 
able evidence supports a mechanistic 
connection. First, on a high-PUFA diet, 
glial droplets are particularly important to 
sustain proliferation. Second, neuroblasts 
show less peroxidative damage if glia can 
make lipid droplets. Third, without glial 
droplets, neuroblasts display increased 
accumulation of 4-HNE, a product of lipid 
peroxidation that damages proteins. 
Exogenous 4-HNE induces ROS, sug- 
gesting a feedback loop that amplifies 
damage and allows its spread across 
the tissue (Figure 1 , right). 

In summary, this paper makes a 
compelling case that lipid droplets play 
an unanticipated, non-cell-autonomous 
role in a stem cell niche, allowing stem 
cells to proliferate even with oxidative 
stress. Sequestration of harmful lipids 
and proteins to protect the rest of the 
cell is a known function of lipid droplets 
(Walther and Farese, 2012; Welte, 2015); 
in the proposed model, sequestration 
protects vulnerable lipids from a noxious 
environment. Yet how lipid droplets pro- 
vide a safe haven for PUFAs remains 
unknown. Is the neutral lipid core simply 
inaccessible to polar ROS or do co-accu- 
mulating lipids or proteins act as antioxi- 
dants? Knowledge about composition, 
cell biology, and biophysics of lipid drop- 
lets is exploding; e.g., artificial droplet-like 
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Figure 1 . Lipid Droplets Enable Neuroblast Proliferation in Hypoxic Conditions 

Left: during normal development, neuroblasts divide asymmetrically, regenerating the neuroblast and generating daughters that will give rise to neurons. 
Adjacent glial cells provide the stem cell niche for the neuroblasts. Right: a magnified view of the glia/neuroblast interface shows the lipid bilayers that make up the 
plasma membranes of the glia and neuroblast cells. Lipids containing PUFAs are shown in green; lipids damaged by peroxidation are in red. At normal oxygen 
levels, neuroblasts keep dividing. Linder hypoxic conditions and in the absence of lipid droplets, ROS induces lipid peroxidation in membranes, which in turn gives 
rise to 4-HNE adducts on proteins. 4-HNE causes more ROS, resulting in an escalating feedback loop. The accumulating damage brings neuroblast proliferation 
to a halt. When glial lipid droplets are present, hypoxic conditions lead to a redistribution of membrane lipids (including PUFA-containing ones) to lipid droplets 
where they are protected from ROS-induced damage. Because membranes now contain fewer targets for lipid peroxidation, the feedback loop is dampened, and 
overall damage remains mild enough for neuroblasts to continue proliferating. It is not fully resolved if hypoxic remodeling lowers the relative concentration of 
PUFAs in the membrane (e.g., by reducing the ratio of lipids to proteins) or if total membrane surface shrinks. 



structures can now be generated in vitro 
(Thiam et al., 2013). Thus, answers to 
these questions are within reach. 

One central mystery is that all kinds of 
fatty acids are rerouted from membranes 
to lipid droplets, not just the most vulner- 
able PUFAs. Is this bulk movement neces- 
sary for protection? It also implies that 
fewer phospholipids remain in the cellular 
membranes. Whether this reduces overall 
membrane surface or alters membrane 
composition is unknown. Probing mem- 
brane properties after hypoxic remodeling 
will likely provide insightful. It will also be 
revealing whether membrane lipids from 
neuroblasts end up in glial droplets or if 
potential ROS targets are only eliminated 
from glial membranes. 

Oxidative stress is a severe, constant 
challenge for organisms, contributing to 
many pathologies as well as to functional 
decline in aging. I therefore suspect that 
this newly discovered antioxidant func- 
tion is deployed widely, especially since 
lipid droplets are ubiquitous. In mamma- 
lian cancer cells, lipid droplets are indeed 



known to enhance cell proliferation 
upon hypoxia (Bensaad et al., 2014). In 
adult flies and mice, neurodegenera- 
tion induces glial lipid droplets in a ROS- 
dependent manner, but here, presence 
of droplets correlates with more severe 
phenotypes (Liu et al., 2015)— in stark 
contrast to the beneficial effect on larval 
neuroblasts. This discrepancy might arise 
from context-specific roles of glial lipid 
droplets, with different lipid pathways 
sustaining proliferating neuroblasts and 
postmitotic neurons. 

Since both too little and too much ROS 
can be harmful, antioxidants levels are 
generally carefully regulated. This prece- 
dent suggests that droplets’ antioxidant 
properties are tunable and that, in pa- 
tients with reduced or excessive lipid stor- 
age, imbalanced ROS may contribute to 
disease. 
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An effective immune response leads to rapid elimination of infectious agents, with seemingly little 
long-term impairment to the host. New findings by Morals da Fonseca et al. reveal that acute infec- 
tions may result in permanent disruption of tissue homeostasis and immune dysfunction, long after 
clearance of a pathogen. 



Throughout the course of one’s lifetime, 
an individual will experience numerous 
episodes of acute and chronic microbial 
infection — including, but not limited to, 
gastrointestinal and respiratory infec- 
tions. Exposure to certain pathogens has 
been shown to have profound, long-last- 
ing consequences on the function of 
the host immune responses (Medzhitov, 
2008, 2010), with some chronic infections 
proposed as triggers for inflammatory 
disorders such as multiple sclerosis, 
lupus, and inflammatory bowel disease 
(IBD) (Karin et al., 2006; Wucherpfennig, 
2001). In contrast, acute infections are 
typically considered self-limiting without 
engendering long-term immune conse- 
quences to the host, though resolution 
nevertheless requires an inflammatory 
response and disruption of tissue homeo- 
stasis. In this issue of Cell, Morais da 
Fonseca et al. reveal that a single acute 
infection with the enteric pathogen Yersi- 
nia pseudotuberculosis results in a per- 
manent dysregulation of tissue-specific 
immunity (Morais da Fonseca et al., 
2015). Long after clearance of Yersinia, 
the infection appears to cause persistent 
impairment of the lymphatic network, 
disrupting molecular and cellular interac- 
tions between the gut and mesenteric 
lymph nodes (MLN), resulting in impaired 
mucosal immune responses (Figure 1). 
This exciting new study reveals that acute 
infections may contribute to long-term 
disruption of tissue homeostasis, pro- 
moting chronic immune dysfunction and 
disease. 

While most interactions between mi- 
crobes and animals are symbiotic in 
nature, the immune system has been 
shaped by an evolutionary “arms race” 
with pathogens. Examples of chronic in- 
fections resulting in persistent inflamma- 



tion and localized tissue damage include 
Mycobacterium tuberculosis (tubercu- 
losis), Helicobacter pylori (peptic ulcers 
and gastric cancer), hepatitis B virus and 
hepatitis C virus (chronic hepatitis and 
cirrhosis), and Borrella burgdorferi (Lyme 
disease) (Karin et al., 2006; Wucherpfen- 
nig, 2001). However, there are fewer ex- 
amples illustrating how acute infections 
may be linked to permanent alterations 
to host immunity. Rheumatic fever, an in- 
flammatory disease involving the heart 
and joints, develops following an acute, 
self-limiting infection of the pharynx with 
group A Streptococcus pyogenes. A 
pathological consequence of molecular 
mimicry, the immune response to strepto- 
coccal M protein elicits cross-reactivity to 
human protein antigens, specifically 
myosin, tropomyosin, and laminin in car- 
diac myocytes. However, whether acute 
infections permanently alter general im- 
mune function has remained unclear, as 
has evidence that a pathogen can trigger 
chronic inflammatory diseases after its 
clearance by the immune system. Here, 
Morais da Fonseca et al. report that 
after an acute gastrointestinal infection 
with Y. pseudotuberculosis, 70% of mice 
develop chronic mesenteric lymphade- 
nopathy (CL"^) despite clearance of Yersi- 
nia. In a series of elegant experiments, the 
authors reveal that infection-induced tis- 
sue damage and remodeling of mesen- 
teric lymphoid tissues promotes impaired 
integrity of the lymphatic network, as 
shown by leakage of fluorescent BODIPY 
from lymphatic vessels into the surround- 
ing mesenteric adipose tissue (MAT). As 
a result of increased lymphatic perme- 
ability, migratory CDIOS'^CDI lb"" DCs 
were shown to accumulate in the MAT 
compartment rather than transiting to 
MLNs as expected (Figure 1), thereby 



blunting robust mucosal immune re- 
sponses. Notably, the defect in lymphatic 
integrity persisted at least nine months 
post- Yers/n/a infection, suggesting dura- 
ble pathologic consequences well after 
clearance of the pathogen. A similar 
finding was previously reported where 
respiratory tract infection with Myco- 
plasma pulmonis led to persistent lym- 
phangiogenesis and reduced transport 
of antigen presenting cells from the air- 
ways to draining lymph nodes (Baluk 
et al., 2005). Morais da Fonseca and co- 
workers extend these previous concepts 
to reveal a mechanism whereby tissue 
damage caused during acute gastrointes- 
tinal infection impairs proper mucosal 
immune function. Specifically, pathogen- 
induced reduction in trafficking of migra- 
tory gut DCs to the MLN compromises 
the ability of these cells to promote oral 
tolerance and protective immunity to oral 
vaccination. 

The authors proceed to show that 
the microbiota is required to sustain 
inflammation after resolution of Yersinia. 
Germ-free (GF) mice infected with 
Y. pseudotuberculosis developed lymph- 
adenopathy, similar to specific-pathogen 
free (SPF) mice. However, migration dy- 
namics of CDIOS'^CDIIb^ DCs from the 
lamina propria to MLNs remained unaf- 
fected in GF mice, suggesting that Yersi- 
nia infection causes mislocalization of 
DCs to the MAT via effects on the micro- 
biota. Antibiotic treatment depleting the 
microbiota in SPF mice was sufficient to 
reduce cell infiltration into the surrounding 
MAT, restoring proper DC trafficking 
and robust mucosal immunity, further 
implicating the microbiota as a vehicle 
for sustained immune dysregulation 
following resolution of acute infection. It 
is important to note, however, that the 
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Figure 1. Proposed Model for Durable Inflammatory Responses Following Acute Infection 

Under normal conditions (left), migratory CD1 03"^ CD1 1 DCs recognize and internalize antigens from the 
gut and traffic to MLNs via lymphatic vessels. In the MLNs, CD1 03"^ CD1 1 b'^ DCs induce TREG and TH1 7 
responses to promote oral tolerance and protective immunity to oral vaccination. Following the clearance 
of Yersinia pseudotubercuiosis (right), infection-induced tissue damage leads to increased permeability of 
the lymphatic network. As a result, migratory DCs accumulate in the MAT compartments rather than 
trafficking to MLNs, blunting TREG and TH17 responses. The microbiota sustains inflammation following 
clearance of Yersinia, as enhanced exposure to microbial ligands occur due to increased permeability of 
the lymphatic system. MLN, mesenteric lymph nodes; MAT, mesenteric adipose tissues. 



development of chronic lymphadenopa- 
thy was not associated with either a 
bloom of pathogenic bacterial strains 
(pathobionts) or dysbiosis of the gut mi- 
crobiota, as lymphadenopathy was not 
transmissible via co-housing or fecal 
transplant into uninfected GF mice. 
Rather, it was likely due to enhanced 
exposure of the immune system to gut mi- 
crobial ligands as a result of defective 
lymphatic integrity, as treatment with bac- 
terial products alone was able to promote 
and sustain inflammation following enteric 
infection. 

As features of chronic inflammatory dis- 
eases typically mirror immune responses 
to acute infection, identification of a direct 
link between a causal infectious agent and 
chronic disease has been challenging. For 
Crohn’s disease, a chronic inflammatory 
condition of the gastrointestinal tract, 
many, including Crohn and colleagues, 
have postulated an inciting infectious 



agent may be responsible for the initiation 
of disease (Crohn et al., 1 984; Hugot et al., 
2003; Sartor, 2005). Despite extensive 
research efforts, a causative organism(s) 
for Crohn’s disease has yet to be repro- 
ducibly validated, though this issue re- 
mains controversial (Greenstein, 2003). 
Perhaps chronic inflammation observed 
in Crohn’s patients may be the result of 
persistent effects of a previously encoun- 
tered infection that caused a long-lasting 
impairment of the immune system. The 
study by Morais da Fonseca et al. pro- 
vides a novel framework to examine how 
a cleared infection may induce permanent 
tissue remodeling and altered immune re- 
sponses, contributing to chronic disease 
later in life. However, outstanding ques- 
tions remain despite these comprehensive 
advances; namely the mechanism by 
which Yersinia promotes chronic lymph- 
adenopathy remains unresolved. What 
cellular and molecular changes to the 



host impair integrity of lymphatic vessels? 
Are there specific bacterial factors that are 
required for this outcome? Further, it is un- 
known why certain populations of mice 
undergo lymphatic remodeling, whereas 
others (cage mates) are protected from 
lymphadenopathy. Not surprisingly, the 
outcomes uncovered by this new study 
are unlikely to be specific to Yersinia path- 
ogenesis as the authors showed Toxo- 
piasma gondii infection, as well as T cell 
transfer of colitis, are both sufficient to 
promote chronic lymphadenophathy. 
These data suggest that chronic inflam- 
matory diseases may be caused by acute 
infections and are a consequence of a 
generalized host response. Collectively, 
the innovative findings from Morais da 
Fonseca and coworkers reveal an 
intriguing hypothesis whereby remodeling 
of the immune system by an acute infec- 
tion may cause chronic tissue-specific 
damage and symptoms associated with 
IBD and autoimmunity and may help 
explain a possible etiology of various idio- 
pathic inflammatory conditions. 
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Gut bacteria are known to affect immune cell development, but most intestinal lymphocytes have 
no direct contact with luminal bacteria. Two studies by Atarashi et al. and Sano et al. shed light on 
how bacterial adhesion can cue intestinal epithelial cells to direct differentiation of gut T cells. 



The mammalian intestine houses millions 
of bacteria, which under homeostatic 
conditions cooperate with host cells to 
facilitate normal metabolism, immune 
system development, and pathogen 
resistance. This is largely conducted in a 
contact-independent manner via bacte- 
rial metabolites, since the majority of in- 
testinal bacteria are non-adherent and 
do not cross the dense mucous layer 
that overlies the epithelium. For instance, 
short-chain fatty acids produced by 
certain bacteria in the lumen of the colon 
readily diffuse across epithelial cell mem- 
branes to be directly utilized by epithelial 
cells themselves or to instruct differentia- 
tion of gut immune cells, particularly 
regulatory T cells (Smith et al., 2013). In 
addition, unique microbial molecular pat- 
terns, such as fragments of bacterial cell 
walls, are sensed by immune cells to 
trigger secretion of soluble factors and 
amplification of immune responses at 
sites of infection. At the interface between 
the microbe-rich lumen and the underly- 
ing immune-rich lamina propria, intestinal 
epithelial cells may therefore serve as 
major transmitters of microbe-derived 
immunomodulatory signals in the gut. In 
this issue of Cell, Atarashi et al. (2015) 
and Sano et al. (2015) now elucidate 
how bacterial adhesion to intestinal 
epithelial cells can direct the functional 
differentiation of gut Th17 cells. 

Production of the cytokine IL-17, the 
defining characteristic of Th17 cells, has 
been shown to occur in CD4+ T cells 
in protective responses against several 
extracellular mucosal pathogens such 
as Candida albicans and CItrobacter 
rodentium (Conti et al., 2009; Mangan 
et al., 2006). Indigenous gut bacteria are 
required to induce steady-state Th17 dif- 



ferentiation as evidenced by the near 
absence of Th17 cells in germ-free ani- 
mals (Ivanov et al., 2009). Through sys- 
tematic colonization of germ-free mice 
with various bacterial isolates, Gaboriau- 
Routhiau et al. found commensal 
segmented filamentous bacteria (SFB) to 
be sufficient to activate various effector 
CD4+ T cell responses in the gut, particu- 
larly Th17 responses (Gaboriau-Routhiau 
et al., 2009). By comparing the microbiota 
of Thi 7-sufficient mice (Taconic) to that of 
naturally Thi 7-insufficient mice (Jack- 
son), Ivanov and coworkers simulta- 
neously identified SFB as the major 
microbial determinant of Thi 7 induction 
in conventional Taconic C57BL/6 mice 
(Ivanov et al., 2009). However, it has re- 
mained unclear what characteristic of 
these microbes elicits IL-17 expression 
and how Thi 7 cells arise in the absence 
of pathogenic infection. 

In this new study, Sano et al. (201 5) uti- 
lized SFB-specific transgenic T cells and 
MHCll tetramers to track the maturation 
of SFB-specific CD4+ T cells temporally 
along the gastrointestinal tract after acute 
colonization. They observed that, while 
“poised” transgenic T cells expressing 
the Thi 7 master regulator RORyt could 
be found throughout the gut, they were 
most abundant in the small intestine 
ileum. The ileum was also the location 
with the highest level of SFB colonization, 
and it was only here that transgenic T cells 
robustly expressed IL-17. In their seminal 
study, Gaboriau-Routhiau et al. previ- 
ously noted that the most robust cytokine 
induction occurred in colonization condi- 
tions that included SFB adherence 
to Peyer’s patches (Gaboriau-Routhiau 
et al., 2009). Now, in elegant complemen- 
tary studies, Atarashi et al. (2015) show 



this preferential IL-17 induction to be 
highly dependent on enhanced SFB 
adhesion to the small intestine epithelium, 
particularly the follicle-associated epithe- 
lium. By inoculating mice with a rat strain 
of SFB that is equally efficient at colo- 
nizing but unable to adhere to small intes- 
tine epithelial cells, they demonstrated 
that SFB colonization itself was not 
sufficient to induce Thi 7 cells in the 
absence of adhesion. They went on to 
establish that other adherent microbes 
(C. rodentium, EHEC, and C. albicans) 
are also able to promote IL-17 production 
and that this property is lost when these 
microbes are rendered non-adherent. 

Ivanov et al. previously showed that 
Thi 7 induction was associated with in- 
creased expression of all three isoforms 
of serum amyloid A (SAA) in the terminal 
ileum (Ivanov et al., 2009). However, the 
vital role of epithelial cell-derived SAA 
expression in Thi 7 differentiation has 
only now been revealed. Using SAA-defi- 
cient animals and an in vitro Thi 7 dif- 
ferentiation system, Sano et al. (2015) 
determined that SAA was both necessary 
and sufficient to directly promote IL-17 
production and proliferation in RORyt+ 
CD4+ T cells. Atarashi et al. (2015), on 
the other hand, described SAA-depen- 
dent production of Th17-promoting cyto- 
kines by CD11c+ myeloid cells and an 
additional requirement for adhesion-eli- 
cited epithelial cell production of reactive 
oxygen species. Although neither study 
focused on the identification of the 
myeloid lineage responsible for these ef- 
fects, other recent studies have uncov- 
ered a critical role for monocyte-derived 
CXsCRI^ cells in the generation of SFB- 
specific Thi 7 responses (Panea et al., 
2015). Additionally, the Littman group 
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Figure 1. Bacterial Adhesion to Intestinal 
Epithelial Cells Forges Gut T Cell Function 

In response to attachment of SFB or other adherent 
bacteria (the “hammer”), intestinal epithelial cells 
upregulate and release serum amyloid A (SAA). 
SAA boosts myeloid cell (presumably CX3CR1 + 
mononuclear antigen-presenting cells) production 
of IL-1 b and perhaps IL-23. IL-1 b can amplify SAA 
expression by signaling directly to the lECs, and 
IL-1b and IL-23 promote production of IL-22 in 
type 3 innate lymphoid cells (ILC3). IL-22, in turn, 
reinforces SAA expression by lECs. This circuitry 
promotes IL-1 7 expression in “poised” RORgt+ 
CD4+ T cells. 

demonstrated that CXsCRI"^ cell-derived 
lU-p and IL-23 induces ILC3 production 
of IL-22 (Longman et al., 2014). While 
both Sano et al. (2015) and Atarashi 
et al. (2015) identify a requirement for 
SFB-induced ILC3 production of IL-22 in 
potentiating SAA expression by epithelial 
cells, it is still unclear whether IL-22 is suf- 
ficient to prompt SAA-mediated IL-1 7 
expression. It is therefore plausible that 
CXsCRI"^ cells are uniquely reactive to 
SAAs, possibly by preferential expression 



of SAA receptors or lipoprotein particle 
receptors, and are thereby specialized to 
amplify epithelial cell SAA expression 
(via IL-lp) and contribute to Th17 induc- 
tion. These Th17-inducing effects of 
SAAs may also be balanced by a possible 
regulatory role of retinol-binding SAAs 
(Derebe et al., 2014), opening up the pos- 
sibility that the nature of co-factors car- 
ried by SAA may influence regulatory 
versus effector T cell function in the gut. 

Much like the sledgehammer forging 
steel (Leon Trotsky), it is possible that 
epithelial cells are exquisitely responsive 
to cytoplasmic perturbations induced by 
physical and mechanical cues. Hence, 
bacterial adhesion (the “hammer,” Fig- 
ure 1) may shape downstream signaling, 
such as Stat3 phosphorylation (Sano 
et al., 201 5) or C/EBP6 upregulation (Atar- 
ashi et al., 2015), via mechanotransduc- 
tion mechanisms. Although neither the 
intracellular signaling pathways triggered 
upon bacterial adhesion nor the exact up- 
stream events that lead to SAA secretion 
are fully understood, these studies define 
a new pathway of epithelial cell sensing of 
intestinal microbes and modulation of 
adaptive immune responses, highlighting 
the importance of local environmental 
cues in licensing effector cytokine pro- 
duction. This could have important impli- 
cations to the physiological functions of 
Th17 cells in local control of pathogenic 
infections and systemic autoimmune re- 
sponses. 
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Berchowitz et al. establish that transient amyloid-like forms of Rim4, a yeast RNA-binding protein 
with a predicted prion domain, translational ly repress cyclin CLB3 in meiosis I, thereby ensuring 
homologous chromosome segregation. These findings suggest that prion domains might enable 
formation of tightly regulated amyloid-like effectors in diverse functional settings. 



Meiosis yields haploid gametes from 
diploid progenitors via two consecutive 
rounds of chromosome segregation, 
meiosis I and II, following a single S 
phase. During meiosis I, homologous 
chromosomes segregate, whereas in 
meiosis II sister chromatids separate. 
Proper meiotic progression requires 
translational regulation of specific cyclins, 
such as the B-type cyclin, CLB3 in 
budding yeast, which is repressed during 
meiosis I but translated during meiosis II 
(Carlile and Amon, 2008). Aberrant 
expression of CLB3 during meiosis I pre- 
vents the essential segregation of homol- 
ogous chromosomes and triggers prema- 
ture sister chromatid separation (Carlile 
and Amon, 2008). In this issue of Cell, an 
elegant study by Berchowitz et al. (2015) 
now establishes that Rim4, a meiosis- 
specific RNA-binding protein (RBP) with 
a predicted prion domain (Figure 1A) (Al- 
berti et al., 2009), forms functional amy- 
loid-like conformers that directly repress 
CLB3 translation during meiosis I (Ber- 
chowitz et al., 2015). 

Rim4 is a 713-residue RBP with three 
N-terminal RNA-recognitions motifs 
(RRMs) and a C-terminal low-complexity 
(LC) domain predicted to be predomi- 
nantly disordered, which harbors a puta- 
tive prion domain (Figure 1A) (Alberti 
et al., 2009). Prion domains have an 
unusual, low-complexity amino acid 
composition comprised of predominantly 
uncharged polar amino acids (particularly 
glutamine, asparagine, and tyrosine) and 
glycine (Alberti et al., 2009; King et al., 
2012). In canonical yeast prion proteins, 
the prion domain can switch from an 



intrinsically disordered conformation to 
an infectious cross-p conformation (Al- 
berti et al., 2009; King et al., 2012). In the 
present study, the authors showed that 
in isolation, Rim4 spontaneously assem- 
bled into p sheet-rich fibrils, and that 
Rim4 fibrillization was RNA-independent 
and required the C-terminal LC domain 
(Berchowitz et al., 2015). Moreover, 
in vivo, the predicted prion domain plus 
C-terminal determinants in the LC domain 
of Rim4 mediated formation of stable, 
SDS-resistant aggregates during pre- 
meiotic S phase, which sequestered spe- 
cific mRNAs and repressed cyclin CLB3 
translation exclusively during meiosis I 
(Figure IB) (Berchowitz et al., 2015). 
Indeed, Rim4 variants bearing C-terminal 
deletions in the LC domain were unable 
to form amyloid-like species or elicit 
translational repression in this context 
(Berchowitz et al., 2015). Remarkably, at 
the onset of meiosis II, increased Ime2 ki- 
nase activity triggers rapid degradation of 
amyloid-like Rim4 to enable CLB3 trans- 
lation and sister chromatid segregation 
(Figure IB) (Berchowitz et al., 2015). 
Collectively, these data suggest that the 
formation and elimination of amyloid-like 
Rim4 is exquisitely timed and tightly 
orchestrated to enable a specific transla- 
tional repression modality critical for 
meiosis I. 

Is this mode of regulation specific to 
yeast? The answer is probably no. In 
fact, humans possess four meiosis-spe- 
cific RBPs with predicted prion domains, 
DAZ1-4 (King et al., 2012). Moreover, a 
related RBP, DAZL (DAZ-like), is crucial 
for gametogenesis in mice. In the current 



work, the authors observed that DAZL 
forms SDS-resistant aggregates in cells 
undergoing meiosis but not in non- 
meiotic tissue (Berchowitz et al., 2015). 
Thus, similar to Rim4 in yeast, an amy- 
loid-like form of DAZL, and perhaps 
also DAZ1-4, might play a role in 
translational regulation during mamma- 
lian gametogenesis. 

Rim4 is not the first example of amy- 
loid-like species regulating mRNA trans- 
lation. However, its repression of CLB3 
translation stands in stark contrast to 
the translational activation of specific 
transcripts sparked by amyloid-like 
forms of a neuronal cytoplasmic polya- 
denylation element binding protein from 
Aplysla callfornlca (ApCPEB) (Shorter 
and Lindquist, 2005; Si et al., 2010). 
ApCPEB also harbors a prion domain, 
which drives assembly of amyloid-like 
forms of ApCPEB in response to specific 
neurotransmitter cues that stimulate a 
translational program that enables syn- 
aptic robustness and long-term facilita- 
tion (Shorter and Lindquist, 2005; Si 
et al., 2010). Like Rim4, the amyloid-like 
form of ApCPEB engages specific 
RNAs more avidly (Berchowitz et al., 
2015; Shorter and Lindquist, 2005). This 
shared ability of amyloid-like forms of 
Rim4 and ApCPEB to modulate mRNA 
translation suggests that this type of 
regulation may be more widespread 
than presently appreciated. Remarkably, 
~1% of human proteins harbor a pre- 
dicted prion domain, including at least 
~50 RBPs (Kim et al., 2013; King et al., 
2012; Li et al., 2013). The pervasiveness 
of these domains is cause to ask— is 
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Figure 1. Amyloid-like Forms of Rim4 Repress CLB3 Translation During Meiosis I 

(A) Domain architecture of Rim4. Rim4 is divided into an N-terminai RNA-binding domain (red) harboring 
three RNA-recognition motifs (RRM1-3) and a C-terminai iow-compiexity (LC) domain (biue) harboring a 
predicted prion domain. 

(B) Rim4 expression is induced in premeiotic S phase, and the LC domain of Rim4 (biue) enabies assembiy 
into amyloid-iike species. Upon entry into meiosis i, amyioid-iike Rim4 represses the transiation of cyciin 
CLB3 by binding the 5'UTR of the CLB3 mRNA (purpie). Repression of CLB3 transiation enabies segre- 
gation of homoiogous chromosomes. Upon transition to meiosis ii, ime2 kinase activity increases and 
eiicits rapid degradation of amyioid-iike Rim4, which reiieves inhibition of CLB3 transiation thereby 
faciiitating sister chromatid separation. 



the formation of amyloid or amyloid-like 
states a widely employed mechanism of 
cellular regulation underlying transla- 
tional control and perhaps a spectrum 
of other molecular processes? 

It is important to note that amyloid or 
amyloid-like conformers are not the only 
structures encoded by predicted prion 
domains. Indeed, prior to populating 
self-templating amyioid-iike states, these 
intrinsically disordered regions can un- 
dergo concentration-dependent liquid 
demixing phase transitions. These transi- 
tions enable partitioning of various dy- 
namic compartments with liquid- or gel- 
like properties, including P bodies, stress 
granules, and paraspeckles (Hennig et al., 
2015; Li et al., 2013; Patel et al., 2015). 
Despite the clear utility of both preamyloid 
and amyloid-like conformations encoded 
by predicted prion domains, this type of 



domain is a double-edged sword. Indeed, 
RBPs with predicted prion domains, 
including FUS, TDP-43, hnRNPAI, and 
hnRNPA2, have emerged as causative 
agents in several fatal neurodegenerative 
disorders such as amyotrophic lateral 
sclerosis and frontotemporal dementia 
(Kim et al., 2013; King et al., 2012; Li 
et al., 2013). Here, the predicted prion do- 
mains enable RBP misfolding into delete- 
rious oligomeric and fibrillar structures 
that likely underpin disease, and dis- 
ease-linked mutations can accelerate 
these misfolding events (Kim et al., 2013; 
King et al., 2012; Li et al., 2013; Patel 
et al., 2015). In particular, an aberrant 
phase transition from liquid- or gel-like 
states to more intractable solid phases 
mediated by the prion domain is likely a 
key pathogenic event (Li et al., 201 3; Patel 
et al., 2015). 



A critical goal is to accurately under- 
stand what primary sequence elements 
and auxiliary factors determine whether 
predicted prion domains assemble into 
functional, beneficial structures or delete- 
rious, misfolded conformers that are 
pathogenic. Such understanding will 
empower the design of therapeutics to 
prevent or reverse detrimental RBP mis- 
folding trajectories. In this context, a 
mechanistic understanding of the devel- 
opmentally regulated formation and elim- 
ination of amyloid-like Rim4 is likely to be 
extremely valuable. For example, the 
mechanism by which amyloid-like Rim4 
is rapidly degraded during meiosis II re- 
mains uncharted. It is not clear how 
increased Ime2 kinase activity elicits 
this degradation process or whether it 
requires protein disaggregases (e.g., 
HspllO or Hsp104), proteasomes, or 
autophagy. Delineation of this temporally 
restricted clearance pathway could 
inform effective strategies to eliminate 
pathogenic amyloid conformers. 
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The digital reconstruction of a slice of rat somatosensory cortex from the Blue Brain Project pro- 
vides the most complete simulation of a piece of excitable brain matter to date. To place these 
efforts in context and highlight their strengths and limitations, we introduce a Biological Imitation 
Game, based on Alan Turing’s Imitation Game, that operationalizes the difference between real 
and simulated brains. 



The much awaited opus magnum of the “Blue Brain Project” 
(“BBP”) is appearing in this issue of Cell (Markram et al., 
201 5j. The brain child, so to speak, of the electrophysiologist 
Henry Markram, the BBP is a large-scale, international collabo- 
rative effort that seeks to simulate a synthetic brain using biolog- 
ically realistic models of synapses and nerve cells. The BBP is 
an exceptional (neuro)-engineering effort based on a bold and 
long-term vision. To be successful, it requires meticulous atten- 
tion to detail, a highly interdisciplinary team, and large-scale 
and stable resources. Financially, it is backed by the Swiss gov- 
ernment, with more recent contributions by the pan-European 
Human Brain Project. 

Initiated a decade ago, the ambitious aims of the BBP are to 
amass all relevant knowledge pertaining to the mammalian brain; 
to distill the associated information into an integrated, standard- 
ized, and open-access database; and to endow this massive yet 
static entity with life by simulating spontaneous and sensory- 
driven synaptic activity and the associated cellular electrical 
responses using partial differential equations. The Cell paper 
describes a first-draft digital reconstruction of a section of rat 
somatosensory cortex. It is, without doubt, the most complete 
simulation to date of a piece of excitable brain matter. 

The utility of such massive simulations has been much 
debated — in particular, as they relate to the ill-defined goal of 
“understanding the brain” that is on the masthead of the Human 
Brain Project and the other large-scale brain projects initiated 
over the last 3 years (Kandel et al., 2013). So before we dive 
into the details, let us step back and consider how best to think 
about this category-defying model. 

Introducing a Biological Imitation Game 

In an effort to circumvent the question “can machines think”?, 
the mathematician Alan Turing (Turing, 1950) introduced “The 
Imitation Game” in which an interrogator is tasked with deter- 
mining the identity of two examinees as “male” or “female.” 
This game is imagined to be played by human beings. Turing 
introduced as a measure of the thinking ability of machines 
the length of time the Interrogator needed to play before realizing 
that one of the other players had been replaced by a machine. 
The longer this takes, the more this machine could be argued 
to operationally behave like a thinking human. 

Let us imagine our own Imitation Game, in which a lead inves- 
tigator running a lab is recording from neocortical neurons in a 



rat but has also devised a sophisticated computer model of its 
electrical behavior. Being quite busy with grant writing and other 
administrative tasks, she rarely enters the lab, relying instead on 
reports from her students and postdocs. 

After some discussion, they agree on a protocol X— say, stim- 
ulate sensory input into somatosensory cortex while recording 
from layer 5 pyramidal neurons. The experimentalists in her lab 
judiciously carry out these manipulations and bring her a set of 
plots— extra-cellular recordings, current source density plots, 
and so on. Their computational colleagues do likewise and 
generate the same type of graphs. Let’s call these, respectively, 
A(X) and B(X). 

For the purposes of this operational definition, we assume 
that the investigator does not know whether A or B is the 
experimental data, taken here as ground truth. Both look super- 
ficially similar. To distinguish between the real data, A(X), and 
the simulated data, B(X), the investigator must devise clever ex- 
periments for her group to carry out on both, probing for the 
simulation’s failure modes. Clearly, the longer that it will take 
the investigator, given her accumulated knowledge and insights 
about real brains, to confidently identify the provenance of the 
data, the more the model can be said to mimic the real system. 
We take as a measure of the quality of the simulation the length 
of time it takes her to confidently identify the provenance of 
the data and the level of sophistication of question required 
to do so. 

Thus, the question of how good any one computer simulation 
of reality is can be replaced by an operational measure of how 
long an expert can be fooled by the simulation. As the model 
becomes more and more sophisticated, this will take longer 
and longer. In the limit of a completely faithful digital simulacrum, 
reliably judging which is real and which is synthetic will become 
impossible. Note that, in the spirit of Turing, our operational 
definition bypasses eristic discussions of what is meant by 
“understanding” the brain. 

Looking Under the Hood 

We will now summarize the approach taken by the BBP team. The 
model simulates a segment of 2-week-old rat somatosensory 
cortex, 2 mm tall and 210 iim in radius, about the size of 
a cortical column. The individual nerve cells are derived from 
the Markram laboratory’s heroic efforts over almost two decades 
to record and label more than 1 4,000 neurons in slices taken from 
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Bachelor #1 : What is your favorite neurotransmitter to release 
during spreading waves? Illustration by Phil Lesnar, courtesy of 
the Allen Institute. 

young male Wistar rats (e.g., Markram et al., 1 997 and 2004). The 
dendritic trees and local axonal morphologies of 1 ,009 of these 
cells are completely digitally reconstructed in 3D. The cells fall 
into 55 morphological classes, as determined by the layer in 
which the cell bodies were located and their morphology: 
42 GABAergic and 13 glutamergic types. This is roughly in line 
with the literature (Harris and Shepherd 2015). The electrical 
properties of these neurons are inferred by recording from the 
cell bodies of 3,900 neurons in young rat slices. Using a standard 
electrophysiological protocol, 11 types of firing patterns— ten 
associated with inhibitory and one (continuous adapting) 
with excitatory cells— can be distinguished. From a combined 
dataset of 51 1 morphologically and electro-physiologically char- 
acterized neurons, the BBP team infers 207 morpho-electrical 
types (207 out of a possible 55 x 1 1 = 605 cell types). 

Next, the team populates their column with 31 ,320 neurons 
drawn from these 207 types, with multiple clones of each cell 
type randomly and independently spatially distributed across 
five layers (LI, L2/3, L4, L5, and L6) in accordance with 
measured cell densities and immunohistochemical staining. 

The electrical behavior of each neuron is modeled using 
a classical Hodgkin-Huxley formalism extended to include 13 
voltage- and calcium-dependent conductances inserted into 
the membrane at the cell body and the proximal and distal den- 
dritic tree (Hay et al., 2011). 

At this stage, these neurons with their thick dendrites and 
thin non-myelinated axons are scattered throughout the cylin- 
drical volumes, a dense jungle with roots, flowering plants, 
brushes, trees, and their canopy intertwined. Both cortex and 
forest share a predominantly vertical organization. They differ 
as only the former have highly specialized junctions among 
their members, chemical synapses that convert electrical activ- 
ity in presynaptic axonal boutons into neurotransmitter release 



and back into electrical activity at postsynaptic sites on spines, 
dendrites, or cell bodies. 

The Connectomics Algorithm 

But where to place the all-important synapses? Here, we come 
to the beating heart of the BBP, an algorithmic approach to 
reconstruct synaptic connections (described in detail in a com- 
panion paper [Reimann et al., 2015]). Whenever an axonal 
profile comes into close encounter with a dendrite, a potential 
synapse is marked. Such a potential synapse is the child of 
chance, based on the probabilities that an axonal bouton finds 
itself near a dendrite. This is known as Peter’s rule, stipulating 
that axons seek out dendrites randomly— the higher the bouton 
density and the larger the dendrite, the bigger the connection 
probability. By this measure, most neurons connect to most 
other neurons (there are, after all, close to one billion synapses 
in each mm^ of gray matter). Peters’ rule places a hard 
constraint on the connectivity, in the sense that both axons 
and dendrites need to be simultaneously present for a synapse 
to form. 

Reimann and colleagues (Reimann et al., 2015) know from 
light- and electron-microscopic data derived by a great many 
labs, including their own, that any one functional connection 
among two neurons is implemented by many anatomical synap- 
ses. Put differently, if two cortical neurons are connected, they 
form anywhere between a handful and perhaps 20 anatomical 
synapses; only rarely does a single synapse connect two neu- 
rons. The observation that simple axonal-dendritic proximity— 
which would predict a majority of singleton connections— does 
not explain connectivity highlights the limit of Peters’ rule. It 
also points to the existence of learning rules that prune and 
grow synaptic connections. 

The BBP captures such observed anatomical and morpholog- 
ical constraints in a probabilistic connectomics algorithm that 
prunes the vast number of spatial appositions, leaving a much 
smaller number of actual anatomical synapses: 638 million 
potential synapses yield 37 million actual synapses that form 
8.1 million connections, with 3.6 excitatory and 13.9 inhibitory 
synapses per connection (all values given here are averages; 
the paper gives statistical distributions). These data are com- 
patible with a recent electron-microscopic reconstruction of a 
sliver of cortical tissue (Kasthuri et al., 2015). 

The physiology of these 37 million synapses is based on 
extrapolated data from paired-cell recordings, including their 
all-important short-term facilitating or depressing dynamic 
behavior. 207 types of neurons could, in principle, be connected 
using 207 x 207 = 42,849 different types of synapses. Unfortu- 
nately, for the vast majority of synaptic types, no experimental 
recordings are available, and reasonable inter- and extrapola- 
tions and assumptions must come to the rescue. 

Booting up the Virtual Slice 

With synaptic input in place, fortified by somatic depolarization 
of cells to facilitate their firing (mimicking arousing neuromodula- 
tor substances active under in vivo conditions) and some spon- 
taneous synaptic release, the transmembrane potential of each 
cell evolves according to the nonlinear cable equation. This is 
a partial differential equation that describes how the membrane 
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potential evolves in thin and elongated neuronal processes 
whose membrane properties are described by capacitive and 
conductive processes (Koch 1999). The numerical simulation 
uses a parallel version of the NEURON software that has been 
instrumental to the field (Carnevale and Hines 2006), running 
on an IBM Blue Gene/Q supercomputer at the Swiss National 
Supercomputing Center (CSCS) in Lugano. 

Powering up this digital slice yields reasonable-looking firing 
behavior and slow oscillatory bursts at 1 Hz. The simulation 
goes through some finger exercises, demonstrating its neuro- 
biological authenticity. The overall network activity is very sen- 
sitive to the extracellular calcium concentration [Ca^^o and the 
level of somatic depolarization of all neurons. [Ca^^o> varying 
between 1 mM under in vivo and 2 mM under in vitro con- 
ditions affects the probability of synaptic release. In effect, 
[Ca^^o controls the network’s susceptibility to synaptic pertur- 
bation, while the depolarization level controls the spontaneous 
firing of the neurons. Manipulating these variables flips the 
network between two qualitatively different dynamic regimes. 
Under in vitro conditions ([Ca^^o = 2 mM and less somatic 
depolarization), the network is highly responsive to perturba- 
tions, producing stereotypical and synchronized responses, 
such as spreading waves (Figure 14 in Markram et al., 2015), 
while under in vivo conditions ([Ca^^o closer to 1 mM and 
greater somatic depolarization), responses are graded and 
asynchronous. This transition is quite sharp and is mediated 
by differential changes in the excitatory to inhibitory synaptic 
balance. 

Given the very large number of approximations and extrapola- 
tions that underlie the BBP model, the fact that neurons do not 
erupt into either a paroxysm of epileptic discharge nor descend 
into a coma-like state of electrical silence but act, to a first order, 
as neurons in slices do, is a remarkable achievement. This is an 
important initial success in our Imitation Game, for the very 
reason that it allows the game to continue. Our investigator 
can begin asking tougher questions (of course, it also begs the 
question of whether all of these details were necessary in the first 
place, to which we will return later on). 

Pushing further, the BBP team highlights aspects of in vivo ac- 
tivity that are recapitulated by the model — in particular, the 
extent to which groups of neurons are active in a correlated 
manner (for instance, when spikes fire near simultaneously), 
have no particular temporal relationship, or are anti-correlated. 
Indeed, the digital simulacrum exhibits the same correlational 
structure as predicted on theoretical grounds and observed in 
rodent cortex by Renart et al. (2010). In particular, activities in 
excitatory and inhibitory neurons track each other, generating 
negative correlations in synaptic structure that cancel strong 
shared input. Similarly, the simulation shows repeating triplet 
structures in the synchronous regime that do not appear in the 
asynchronous regime of low calcium. Finally, a recent experi- 
mental study of Okun et al. (2015) establishes the existence of 
many highly correlated “chorister” neurons and a few isolated 
“soloists.” The former are neurons whose activity is tightly corre- 
lated with the average activity of the network that they are 
embedded in, while the latter appear to actively fire indepen- 
dently of the rest of the network. The simulation recapitulates 
this observation as well. 



These demonstrations are comforting validations of the 
model in terms of our Imitation Game. Of course, the transition 
observed by the BBP team is clearly describable by simple 
models (there is a long history, with notable contributions 
from Wilson and Cowan (1 972) and Amah (1 975) of applied math- 
ematics devoted to the solution space of so-called “neural field 
theories,” demonstrating complex behavior as neural character- 
istics are varied (see Bressloff, 2014), including spreading waves 
(Beurle, 1956). 

The most interesting observation of the present study is the 
sensitivity of network behavior to extracellular calcium concen- 
tration and that propagating waves may be an artifact of too 
high [Ca^^o levels (which would not, however, explain the waves 
seen in Xu et al., 2007). Yet the enormous effort of the BBP to 
carefully reconstruct details of the cortical circuit are probably 
irrelevant to this prediction, as the dependency on extracellular 
calcium appears to enter through an empirical model of synaptic 
neurotransmitter release and not through any network dynamics. 
In the same vein, the mechanism of Renart et al. (2010) was 
obtained in a model of conductance-based integrate-and-fire 
neurons. The additional detail added by the BBP does not affect 
the mechanism. 

Uncovering the Limits of the Model 

In the Imitation Game, there is one cunning strategy that is 
guaranteed to lead to the denouement of the impostor. 
Simply probe further and further into the micro-structure: while 
it is not clear whether there is any “ultimate” level of reality- 
elementary particles such as the Higgs boson, fields, or super- 
strings (depending on the energy invested in probing the 
micro-structure) —simulations abruptly bottom out. To wit, the 
investigator could ask her lab members to resolve fine details 
of the macroscopic Na^, K^, Ca^"^, and Cr membrane currents. 
If appropriate electrodes and amplifiers are used, the recordings 
will uncover discrete, microscopic, and stochastic ionic chan- 
nels, while the current BBP model stops with the continuous 
and deterministic Hodgkin-Huxley currents. Bingo! 

And therein lies an important lesson. If the real and the syn- 
thetic can’t be distinguished at the level of firing rate activity 
(even though it is uncontroversial that spiking is caused by the 
concerted action of tens of thousands of ionic channel proteins), 
the molecular level of granularity would appear to be irrelevant 
to explain electrical activity. Teasing out which mechanisms 
contribute to any specific phenomena is essential to what is 
meant by understanding. 

Markram et al. claim that their results point to the minimal 
datasets required to model cortex. However, we are not aware 
of any rigorous argument in the present triptych of manuscripts 
(Markram et al., 2015; Ramaswamy et al., 2015; Reimann 
et al., 2015), specifying the relevant level of granularity. For 
instance, are active dendrites, such as those of the tall, layer 5 
pyramidal cells (Hay et al., 2011), essential? Could they be 
removed without any noticeable effect? Why not replace the 
continuous, macroscopic, and deterministic HH equations with 
stochastic Markov models of thousands of tiny channel conduc- 
tances? Indeed, why not consider quantum mechanical levels 
of descriptions? Presumably, the latter two avenues have not 
been chosen because of their computational burden and the 
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intuition that they are unlikely to be relevant (Strassberg and De- 
Felice, 1993; Koch and Hepp, 2006). The Imitation Game offers a 
principled way of addressing these important questions: only 
add a mechanism if its impact on a specific set of measurables 
can be assessed by a trained observer. 

Consider the problem of numerical weather prediction and 
climate modeling, tasks whose physico-chemical and computa- 
tional complexity is comparable to whole-brain modeling. 
Planet-wide simulations that cover timescales from hours to 
decades require a deep understanding of how physical systems 
interact across multiple scales and careful choices about the 
scale at which different phenomena are modeled. This has led 
to an impressive increase in predictive power since 1950, 
when the first such computer calculations were performed 
(Bauer et al., 201 5). Of course, a key difference between weather 
prediction and whole-brain simulation is that the former has a 
very specific and quantifiable scientific question (to wit: “is it 
going to rain tomorrow?”). The BBP has created an impressive 
initial scaffold that will facilitate asking these kinds of questions 
for brains. 

What is also noteworthy is that, with the publication of this 
paper, the BBP has publicly released a treasure trove of morpho- 
logical, electrical, and connectional data collected over the last 
years by members of the Markram lab that was used in this 
model. For more details, see the companion manuscript (Ram- 
aswamy et al., 2015). The experience of the Allen Institute 
(www.brain-map.org) has demonstrated the impact of releasing 
all relevant data and models soon after they pass an internal 
quality control stage (usually several years prior to publication 
of any associated results). The release of the massive database 
of the BBP is thus a very positive development. 

It is clear that physically detailed, whole-brain simulations 
enable neuroscientists to answer specific questions that are 
difficult to address experimentally. The BBP has provided a 
powerful tool in this regard. Such simulations may be essential 
to develop therapeutics for brain-based diseases. Whole-brain 
models can also be said to encapsulate the present state of 
knowledge, as per the Imitation Game. However, any simula- 
tion-gained knowledge must be supported and complemented 
by theories that, by isolating the relevant variables, enable us 
to deeply understand the most organized and highly excitable 
piece of matter in the known universe. 
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Modern humans overlapped in time and space with other hominins, such as Neanderthals and 
Denisovans, and limited amounts of hybridization occurred. Here, we review recent work that 
has identified archaic hominin sequence that survives in modern human genomes and what these 
genomic excavations reveal about human evolutionary history. 



Introduction 

As recently as 30,000 years ago anatomically modern humans 
coexisted with other hominins, such as Neanderthals and Deni- 
sovans (Figure 1A). Advances in ancient DNA extraction and 
sequencing technologies have enabled high-quality Neanderthal 
(Prufer et al., 2014) and Denisovan (Meyer et al., 2012) reference 
genomes to be developed, and in doing so provided the tools to 
definitively address an often heatedly debated topic in human 
evolution: did our ancestors interbreed with archaic humans? 
Analyses leveraging these archaic whole-genome sequences 
showed that hybridization did indeed happen between our an- 
cestors and Neanderthals (Green et al., 2010; Yang et al., 
201 2; Sankararaman et al., 201 2; Prufer et al., 201 4) and Deniso- 
vans (Reich et al., 2010; Reich et al., 2011). In particular, all 
non-Africans inherited approximately 2% of their genome from 
Neanderthal ancestors and individuals of Melanesian ancestry 
inherited approximately 4%-6% of their genome from Deniso- 
van ancestors. Note, the number of hybridization events 
compatible with the observed levels of archaic ancestry is 
currently unknown and difficult to estimate precisely as it re- 
quires tenuous assumptions about a range of parameters such 
as historical population sizes and fitness of hybrid offspring. 

More recently, the focus of admixture studies has shifted from 
testing the hypothesis that archaic admixture occurred to identi- 
fying the specific sequences that were inherited from archaic an- 
cestors. The two largest studies to date searched for surviving 
Neanderthal sequences in whole-genome sequence data from 
individuals of European and East Asian ancestry (Sankararaman 
et al., 2014; Vernot and Akey, 2014). While these studies used 
different methodological approaches (reviewed by Racimo 
et al., 2015), the key population genetics signatures that allow 
archaic sequences to be identified are unusually high genetic 
divergence from the human reference genome and high 
sequence similarity to the Neanderthal reference genome over 
long (~30-100 kb) stretches of DNA. In total, Vernot and Akey 
(Vernot and Akey, 2014) identified 600 Mb of introgressed Nean- 
derthal sequence and Sankararaman et al. (Sankararaman et al., 
2014) reported 1.1 Gb of introgressed sequence, representing 
20% and 35%, respectively, of the Neanderthal genome. Thus, 
although any given individual carries only a small amount of 
Neanderthal sequence, cumulatively a substantial amount of 



the Neanderthal genome persists in the modern human popula- 
tion and can be recovered by aggregating across hundreds of 
individuals. 

Beyond characterizing the extent of archaic admixture that 
occurred in East Asians and Europeans, the identification and 
analysis of surviving Neanderthal sequence in contemporary 
individuals has revealed a number of fascinating insights about 
human evolutionary history. Here, we briefly summarize what 
has been gleaned from studies of archaic hominin admixture, 
identify existing gaps in knowledge, and suggest fruitful areas 
of future inquiry. 

Widespread Purging of Neanderthal Sequence in 
Modern Humans 

A striking feature of Neanderthal admixture maps is that the 
amount of surviving Neanderthal sequence varies considerably 
across the genome (Figure 1 B; Sankararaman et al., 2014, Ver- 
not and Akey 2014). Such marked heterogeneity suggests that 
there may have been fitness consequences to hybridization 
and some Neanderthal sequences were deleterious in the back- 
ground of modern human genomes. Consistent with this hypoth- 
esis, the frequency of Neanderthal alleles is negatively correlated 
with inferred functional importance (Sankararaman et al., 2014), 
and the odds of observing Neanderthal sequence in a region is 
inversely proportional to the amount of sequence divergence 
between the modern human and Neanderthal genome (Vernot 
and Akey 2014). Both of these signatures are expected if intro- 
gressed Neanderthal sequence experienced widespread purify- 
ing selection in modern humans. Moreover, the X chromosome is 
significantly depleted of Neanderthal sequence, with an approx- 
imate 5-fold reduction in Neanderthal ancestry compared to 
autosomes (Sankararaman et al., 2014), suggesting reduced 
fitness in male hybrid offspring (and perhaps male hybrid 
sterility), a phenomenon observed in many other species (Orr, 
1997). Testis-specific genes are significantly enriched in regions 
depleted of Neanderthal sequence (Sankararaman et al., 2014), 
further supporting the inference of reduced fitness in male hybrid 
offspring. It is important to note that the amount of divergence 
between modern humans and Neanderthals is lower on the X 
chromosome, which may reduce power to detect introgressed 
sequence. Nonetheless, although additional work remains to 
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Figure 1. Insights Gleaned from Studying 
Archaic Hominin Admixture 

(A) Schematic illustration of the out of Africa 
dispersal of modern humans (red). Blue and green 
shading roughly indicates the regions inhabited 
by Neanderthals and Denisovans, respectively. 
Although fossil evidence of Denisovans has been 
found only at Denisova Cave, the observed pattern 
of Denisovan admixture suggests they had a wide 
range that extended southeast. 

(B) Distribution of surviving Neanderthal sequence 
(blue bars) in Europeans and East Asians along 
chromosome 9 (data from Vernot and Akey 2014). 
The stylized plots below show the frequency of 
Neanderthal haplotypes across two genomic re- 
gions. An example of putative adaptive introgres- 
sion (where the Neanderthal haplotype is found at 
an unusually high frequency) spanning the BNC2 
gene is shown on the left and a ^4 Mb desert of 
Neanderthal sequence is shown on the right. 

(C) Demographic model inferred from patterns 
of archaic admixture, which includes at least two 
pulses of Neanderthal admixture and one pulse of 
Denisovan admixture into individuals of Melane- 
sian ancestry. 



they likely had already acquired adaptive 
mutations that were beneficial to the 
unique environmental pressures that 
they evolved in. It is easy to imagine that 
this variation would have experienced 



fully evaluate the inference of male hybrid sterility, it is consistent 
with the other observed signatures of widespread purifying se- 
lection on Neanderthal sequences (Sankararaman et al., 2014). 

In addition to these global genomic signatures of purifying se- 
lection acting on introgressed Neanderthal sequence, maps of 
Neanderthal admixture also reveal a number of specific genomic 
regions that exhibit especially strong depletion of Neanderthal 
haplotypes (Figure IB; Sankararaman et al., 2014, Vernot 
and Akey 2014). These “deserts” of Neanderthal sequence are 
particularly interesting because they may delineate regions that 
contribute to uniquely modern human phenotypes. For example, 
the largest Neanderthal desert on the autosomes spans a ~1 5 Mb 
region on chromosome 7 (Vernot and Akey 201 4) and contains the 
gene FOXP2, a developmentally important transcription factor 
that has previously been implicated in speech and language 
(Enard et al., 2003). Flowever, additional work is necessary to 
rule out other potential explanations for the origins of Neanderthal 
deserts, such as inversions (which suppress recombination) or 
unrecognized extreme demographic histories (such as periods 
of very small population size in modern humans at the time of 
admixture). One promising approach in interpreting the biological 
significance of Neanderthal deserts is the construction of archaic 
admixture maps for other hominin groups, such as Denisovans. 
Deserts that replicate across multiple archaic hominins may be 
of particular biological interest and help refine the genomic sub- 
strates of uniquely modern human traits. 

Adaptive Introgression 

Since archaic hominins lived in Europe and Asia for at least 
200,000 years before modern humans migrated to these regions. 



strong positive selection in modern humans who acquired it 
through hybridization (Racimo et al., 2015). Whole-genome 
Neanderthal admixture mapping has identified several Neander- 
thal haplotypes that persist at higher frequencies than can be ex- 
plained by genetic drift (Sankararaman et al., 2014, Vernot and 
Akey 2014; Racimo et al., 2015). Strikingly, both Sankararaman 
et al. (2014) and Vernot and Akey (2014) identified three genes 
in the small set of putative adaptively introgressed loci that 
play important roles in hair and skin biology. Although the spe- 
cific selective pressure underlying the adaptive introgression at 
these loci is unknown, it is interesting to note that one of these 
genes, BNC2 (Figure 1 B), has recently been shown to influence 
skin pigmentation levels in Europeans (Jacobs et al., 2013). 

Targeted studies of individual loci have also shown that con- 
tact with archaic hominins resulted in adaptive introgression. 
Perhaps the most dramatic example is that a haplotype of 
EPAS1 that was previously shown to confer adaptation to high 
altitude in Tibetans (likely by lowering hemoglobin concentration) 
was found to be inherited from a Denisovan-like ancestor 
(Huerta-Sanchez et al., 2014). Similarly, adaptive introgression 
of Neanderthal or Denisovan sequences has been hypothesized 
for HLA (Abi-Rached et al., 2011) and STAT2 (Mendez et al., 
2012), both of which play important roles in the immune 
response, which is a well-known substrate of selection. Unlike 
theEPAS7 haplotype, introgressed haplotypes in these genomic 
regions are present in multiple human populations. 

Thus, archaic admixture appears to have been more than a 
curious feature of human evolutionary history. Specifically, 
gene flow from archaic hominins provided important adaptive 
benefits to our ancestors as they dispersed out of Africa into 
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new environments. To date, the causal alleles underlying adap- 
tive introgression have not been identified. However, in contrast 
to recent selective sweeps, which often delimit large genomic 
regions (~500 kb; see Akey, 2009), observed adaptively intro- 
gressed haplotypes are typically an order of magnitude smaller 
(median size ~50 kb), which should facilitate fine-scale mapping 
and experimental studies of putative causal adaptive alleles. 

Archaic Admixture Informs Demographic Models of 
Human History 

Initial studies of Neanderthal admixture in geographically diverse 
populations suggested that all non-African populations had 
approximately the same levels of Neanderthal ancestry (Green 
et al., 201 0). Thus, the most parsimonious model to explain these 
data was that hybridization occurred once in the ancestral pop- 
ulation to all present day non-Africans, likely in the Levant shortly 
after the dispersal of modern humans out of Africa (Sankarara- 
man et al., 2012). The identification of specific surviving Nean- 
derthal sequence, however, allowed more refined estimates of 
Neanderthal ancestry in different populations and, unexpect- 
edly, East Asian individuals were found to have on average 
20% more Neanderthal sequence than European individuals 
(Wall et al., 2013, Sankararaman et al., 2014, Vernot and Akey 
2014). 

The observation of higher Neanderthal ancestry in East Asians 
prompted reconsideration of the single-pulse model, and new 
models suggest at least two distinct admixture events— an initial 
pulse of admixture into the common ancestor of all present day 
non-African populations and an additional pulse of admixture 
into the ancestors of East Asians after their divergence from 
European populations (Figure 1C; Vernot and Akey 2014). An 
alternative explanation that has been proposed is a single pulse 
of admixture followed by less efficient purging of deleterious 
Neanderthal sequence in East Asians (Sankararaman et al., 
2014), given their smaller effective population sizes (Keinan 
et al., 2007). More recent analyses (Vernot and Akey, 2015, 
Kim and Lohmueller, 2015) using distinct statistical approaches 
found that the two-pulse model better explained the data. Even 
more directly, the recent observation of Neanderthal admixture 
in a ~40,000 year old early modern human individual from 
Romania (Fu et al., 2015) dramatically shows that admixture 
occurred in multiple places and times. It is important to stress 
that even a two-pulse model of admixture with Neanderthals is 
likely a simplification, and as shown by Vernot and Akey (2015) 
additional admixture scenarios are also compatible with 
currently available data. 

Currently, much less is known about admixture dynamics be- 
tween modern humans and Denisovans, whose genetic contri- 
bution has been definitively found only in Melanesians (Reich 
et al., 2010; Reich et al., 2011), with potentially very low levels 
of Denisovan ancestry in East Asians (Skoglund and Jakobsson, 
2011). Papua New Guineans carry the highest level of Denisovan 
ancestry (4%-6% of the genome) among all populations studied 
to date, with other Southeast Asian island populations generally 
showing Denisovan ancestry levels consistent with indirect 
acquisition of introgressed sequence through recent gene flow 
from populations related to Papua New Guineans (Reich et al., 
2011). These findings, combined with the observed frequencies 



of Neanderthal ancestry, suggest that modern humans peopled 
Asia in at least two distinct waves, with one wave taking a south- 
ern route and acquiring Denisovan ancestry and a separate wave 
responsible for colonization of East Asia and Indonesia. The fos- 
sil evidence of Denisovans is limited to a single specimen discov- 
ered much farther north, in Siberia, but the geographic pattern 
of Denisovan ancestry strongly suggests that Denisovans had 
a range that extended into Southeast Asia (Figure 1A). These 
inferences demonstrate how studies of archaic admixture can 
be incorporated with other information to refine demographic 
models of modern human history. 

Future Challenges and Opportunities 

Recent studies of archaic hominin admixture in modern humans 
have provided fascinating new insights into human evolutionary 
history and the role that hybridization has played in shaping 
extant patterns of human genetic variation. Despite this progress, 
considerable gaps in knowledge remain. Perhaps most obvi- 
ously, it will be of great interest to comprehensively identify Nean- 
derthal and Denisovan sequences in geographically diverse 
populations. Such data will be critical for refining admixture 
models and may lead to the identification of additional examples 
of adaptive introgression. Furthermore, creating maps of surviv- 
ing archaic sequences across globally diverse populations will 
enable introgression deserts to be identified with more precision 
and confidence, which ultimately may allow the genetic sub- 
strates of uniquely modern human traits to be discovered. 

Another key opportunity is to systematically explore the 
phenotypic legacy of archaic hominin admixture. Initial analyses 
have already shown that genetic variation inherited from Nean- 
derthals contributes to a wide range of human diseases (Sankar- 
araman et al., 201 4; Williams et al., 201 4). For example, a type 2 
diabetes risk haplotype that is common in Latin Americans and 
East Asians but rare in Europeans has been found to originate 
in Neanderthals (Williams et al., 2014). Moreover, the resources 
to perform phenome-wide association studies by integrating 
electronic medical records with variants that tag archaic haplo- 
types are becoming increasingly available (Denny et al., 2013). 
Such analyses will allow comprehensive insights into how 
hybridization has contributed to human phenotypic diversity, 
and whether particular diseases have a disproportionately 
high burden attributable to risk variants inherited from archaic 
ancestors. 

Furthermore, new methodological tools need to be developed 
to fully exploit the information contained in catalogs of intro- 
gressed sequence identified in geographically diverse popula- 
tions. In contrast to sequencing ancient DNA from a single 
fossil, the recovery of surviving archaic sequences in modern hu- 
mans provides population level data in that the sequences 
recovered come from multiple archaic ancestors. Thus, in the- 
ory, more powerful population genetics inferences are possible 
compared to cases where genetic information exists for a single 
individual. Methodological innovations will bring us closer to 
answering questions such as the size and genetic structure of 
archaic populations, the number of archaic ancestors that 
contributed to surviving archaic sequence in modern humans, 
dates of introgression, and whether sex-biased admixture 
occurred. 
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Finally, we anticipate that future studies of archaic admixture 
will increasingly focus on identifying and characterizing DNA se- 
quences in the genomes of modern humans that were inherited 
from hominins other than Neanderthals or Denisovans. The 
study of African populations may be particularly fruitful, as 
many hominin lineages existed in Africa. Indeed, preliminary ge- 
netic evidence of introgression from an unknown hominin 
ancestor has been observed in several African populations 
(Hammer et al., 2011; Lachance et al., 2012). The absence of 
reference genome sequences will complicate the search for 
gene flow from unknown hominin lineages, although population 
genetics tools do exist for identifying putatively introgressed 
sequence without explicitly using an archaic reference genome 
(Hammer et al., 2011, Vernot and Akey 2014). Despite these 
challenges, excavating extinct hominin lineages that persist in 
the DNA of contemporary individuals may be the only way to 
identify such groups where no ancient DNA or fossil data exists. 

In summary, although the amount of surviving archaic 
sequence in any individual modern human genome is small, 
archaic admixture has had a profound lasting impact, not only 
in shaping who we are today but also in informing us about our 
history. Ironically, studying the scattered remnants of surviving 
archaic DNA that persist in modern humans may lead to a deeper 
understanding of why we survived to expand and thrive while our 
close relatives did not. 
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Molecular codes, like postal zip codes, are generally considered a robust way to ensure the spec- 
ificity of neuronal target selection. However, a code capable of unambiguously generating complex 
neural circuits is difficult to conceive. Here, we re-examine the notion of molecular codes in the light 
of developmental algorithms. We explore how molecules and mechanisms that have been consid- 
ered part of a code may alternatively implement simple pattern formation rules sufficient to ensure 
wiring specificity in neural circuits. This analysis delineates a pattern-based framework for circuit 
construction that may contribute to our understanding of brain wiring. 



Introduction 

The brain, as we neuroscientists like to say, is really complex. A 
good deal of our efforts are therefore dedicated to figuring out 
just how this apparent complexity is generated: where does 
the information to build a brain come from, and how is such infor- 
mation turned into synapse-specific wiring? We call this the 
“brain wiring problem.” 

We know how it is not done. For example, there cannot be a 
genetic blueprint that describes every synaptic connection in 
the way a blueprint of a microchip or electrical wiring diagram 
does (Figure 1A). Why is this? A blueprint can clearly be envi- 
sioned that precisely matches any given neural circuit, allowing 
for it to be reproducibly built. However, the complexity of such 
a blueprint exactly equals the complexity of the actual wiring di- 
agram. In other words, this solution generates a new problem 
that is just as difficult: How is the blueprint generated? It is like 
answering the question of how life evolved on earth by arguing 
that it may well have arrived here from a different planet. This 
is indeed a solution, but an unsatisfying one because it leaves 
the equally difficult and interesting question of how life evolved 
on some other planet unresolved. 

In this Perspective, we will discuss the concept of develop- 
mental algorithms as a solution to the problem of how only a 
few thousand genes can produce complexity in brain wiring. 
We will begin by defining some of the key terminology. Next, 
we consider the evidence for deterministic molecular codes 
that could define connections, akin to blueprints defining electri- 
cal circuits. We will then discuss examples of molecules that 
were once envisioned to be part of a code but were subsequently 
revealed to execute developmental rules that integrate stochas- 
tic processes in the development of neural circuits. These exam- 
ples highlight how complicated structures can be generated 
through simple pattern formation rules rather than elaborately 
defined addresses. In the last section, we will discuss the differ- 
ence between an understanding of brain wiring in terms of devel- 



opmental rules versus the more common focus on mechanisms 
of individual molecules that execute those rules. 

Genetically Encoded Stochastic Invariability? 

In order to understand to what extent and how genes can encode 
brain wiring, we need to first establish a few simple definitions. 
First, we define developmental outcomes as “genetically en- 
coded” if environmental input does not contribute any instructive 
information to that outcome. A common assumption is that a 
genetically encoded process is invariable, but this would be a 
misunderstanding. For example, grafting two branches from 
the same apple tree on two different root stocks will generate 
two genetically identical trees with non-identical branching pat- 
terns— just as no two dendritic arbors of genetically identical 
Purkinje cells ever look exactly the same (Figure 1 B). A simple al- 
gorithm can generate an invariable branching pattern (Prusinkie- 
wicz and Lindenmayer, 1990) (Figure 1C); however, introduction 
of random inaccuracies can create variability in such a system 
(Figure ID). Small environmental differences undoubtedly play 
a role in generating the differences observed in branching pat- 
terns between genetically identical specimens. However, this 
type of differential environmental input does not contain informa- 
tion for any specific branching pattern; rather, the developmental 
program ensures robustness of functionally important aspects 
and allows for variability otherwise. By our definition, such a pro- 
cess is genetically encoded, because the environmental input 
did not contribute any instructive information to generate that 
outcome: in both the apple tree and the Purkinje cell, the precise 
position of branches is irrelevant as long as the final branched 
structure covers a certain region in a specific manner. Hence, 
genetically encoded developmental algorithms can produce 
similar but non-identical structures in the brain. 

Small environmental differences are a random variable and 
therefore define a stochastic process that can lead to variability 
in the outcome. Conversely, a system is defined as deterministic 
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Figure 1. From Deterministic Blueprints to Stochastically Branched Structures in Biology 

(A) Schematic of a hypothetical electrical blueprint with deterministic definition of all contacts. 

(B) Schematic drawing of a Purkinje cell after a well-known drawing from Ramon y Cajal. The precise branching pattern, number, and placement of dendritic 
endings is variable. 

(C) A simple computer-generated branched structure. The deterministic definition of all branches is generated by a few lines of code (Lindenmayer system) using 
L-Studio 4.2.13 by Przemyslaw Prusinkiewicz and Radek Karwowski. 

(D) The same branched structure as in C, but with stochastic pattern changes. 



if no randomness is involved in the development of future states 
of the system. However, invariability in the outcome does not 
require a deterministic system. As we shall see throughout this 
Perspective, even when the outcomes appear invariable, the 
processes that generate them are often stochastic. All geneti- 
cally encoded developmental signaling events contain stochas- 
tic processes. Notch signaling provides an excellent example. 
This pathway encodes a molecular mechanism that breaks 
the symmetry in cell differentiation by ensuring that only one 
daughter cell becomes cell type A, the other type B. The 
outcome of exactly one cell type A and B is invariable, but it is 
crucial to consider that the fate from the perspective of the indi- 
vidual cell is in fact stochastic, because it is impossible to predict 
which of the two cells will be A and which will be B. We call this a 
genetically encoded stochastic process with an invariable, or 
highly stereotyped, outcome. Thus, be it during the development 
of the vertebrate heart (de la Pompa and Epstein, 201 2) or the fly 
eye (Carthew, 2007), stochastic processes are parts of geneti- 
cally encoded developmental programs that lead to highly ste- 
reotyped and robust outcomes. 

A particularly insightful example of a developmental process 
critical for brain wiring is the idea that “cells that fire together, 
wire together” in the mammalian visual system (Shatz, 1996). 
This process is based on spontaneous activity waves that Shatz 
and colleagues first saw sweeping over the ferret retina even 
before these cells are capable of receiving environmental input. 
While the activity waves are stochastic, they result in stereo- 
typed layer formation in the lateral geniculate nucleus. Some 
variability occurs (e.g., the precise size of the layers), but no envi- 
ronmental input contributes instructive information to generate 
the developmental outcome. Hence, we consider this process 
a genetically encoded stochastic process with a highly stereo- 
typed outcome. As an aside, it follows from these considerations 
that an “activity-dependent” process can be part of a genetically 
encoded program. In this Perspective, we only explore such 



genetically encoded processes, while the important roles of 
environmental input in activity-dependent synaptic fine tuning 
are reviewed elsewhere (Ganguly and Poo, 2013; West and 
Greenberg, 2011). Obviously, genes do not encode stochastic 
spontaneous neuronal activity just as they do not encode sto- 
chastic branching patterns. Instead, in both cases, gene activity 
defines the developmental algorithms that lead to such cellular 
behaviors. 

An important lesson from these examples is the use of sto- 
chastic processes as an integral and necessary part of develop- 
mental algorithms, which contrasts with the view of noise as 
something that development just has to cope with or minimize 
in order to create a robust outcome (Clarke, 2012; Mele et al., 
2015). Recent work in several fields has highlighted the impor- 
tance of understanding both stochastic processes and hetero- 
geneity of cellular behavior. These fundamental features of all 
biological systems are lost when we focus on studying averages 
(Altschuler and Wu, 2010; Losick and Desplan, 2008). How 
developmental algorithms generate apparent complexity in brain 
wiring therefore requires insights into the developmental process 
that are often non-intuitive and quantitative. 

Molecular Codes and the “Complexity Reduction Model” 

What fundamental solutions to the brain wiring problem do mo- 
lecular codes offer? Genes encode molecules, and molecules 
can theoretically provide combinatorial codes of almost any 
complexity. The success story of molecular biology and gene 
discovery provides us with ample examples. The obvious candi- 
dates for establishing molecular codes in intercellular, synapse- 
specific interactions are secreted and membrane-associated 
“guidance cues,” their receptors, and cell adhesion molecules 
(Kolodkin and Tessier-Lavigne, 2011). These cues include mole- 
cules that belong to “canonical” guidance cue families (Netrins, 
Slits, Semphorins, and Ephrins) as well as cell adhesion mole- 
cules of the immunoglobulin or cadherin superfamilies. The 
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canonical secreted and membrane-associated guidance cues 
function at long range or short range to mediate attractive or 
repulsive signals; cell adhesion molecules may function through 
direct contact-mediated hemophilic or heterophilc interactions. 
These molecules and mechanisms are reviewed in detail else- 
where (Kolodkin and Tessier-Lavigne, 2011; Raper and Mason, 
2010; Yogev and Shen, 2014). 

Many of these genes can produce mRNAs resulting from 
different splice variants or utilization of multiple promoters, 
further increasing the numerical potential for different combina- 
torial codes. The most impressive examples are invertebrate 
Dscams and vertebrate Protocadherins, since members of 
each family are present in thousands of different splice variants 
that are required for wiring specificity (Lefebvre et al., 2012; 
Schmucker et al., 2000; Zipursky and Sanes, 2010). In addition, 
many of the key growth factor and embryonic patterning path- 
ways such as Wnt, FGF, EGF, and BMP have been found to be 
required for brain wiring (Charron and Tessier-Lavigne, 2005; 
Srahna et al., 2006). Thus, a numerically large array of molecular 
combinations is, in principle, available for neurons to both 
display and respond to. 

A common feature of guidance cues and receptors is their cell- 
specific expression and spatiotemporally dynamic localization 
(Chan et al., 2011; Williamson et al., 2010; Zschatzsch et al., 
2014). This observation further supports their potential roles in 
establishing molecular codes; temporal coding using the same 
guidance receptor can generate a “code in time” to specify 
target areas (Petrovic and Hummel, 2008; Yogev and Shen, 
2014). Hence, a picture emerges in which different cells may ex- 
press distinct combinations of guidance cues and receptors; 
different combinations of these molecules can then be pre- 
sented at distinct places and at specific times during brain wiring 
to provide unique targeting and synapse formation signals. It is 
also often argued that early connectivity events take place 
in much less complicated wiring environments than the final 
pattern might indicate. The resulting model thus assumes a 
stepwise process successively restricting possible targeting 
choices such that the problem of choosing among thousands 
of options may never occur. We call this the “complexity reduc- 
tion model.” 

Numerous elegant mechanisms that contribute to com- 
plexity reduction have been put forth. For example, classic 
guidance cues can form gradients that help in the parallel tar- 
geting of many axons simultaneously (Kolodkin and Tessier- 
Lavigne, 2011; Yogev and Shen, 2014). Parallelized targeting 
creates repetition and thus redundancy in information encod- 
ing: a molecularly encoded blueprint for such a system only 
needs to specify one of the repetitive units. In addition, even 
a limited number of guidance cues may be consistent with 
the apparent complexity of brain wiring if specificity is achieved 
through regulation of guidance cues in space and time, 
through protein expression levels, or through the utilization of 
heteromultimeric receptor machinery. The complexity reduc- 
tion model is thereby proposed to provide a solution to the 
brain wiring problem: we may not yet have worked out all the 
details of when, where, and what kinds of combinatorial molec- 
ular codes occur, but there seems to be no fundamental prob- 
lem. Or is there? 



The Molecular Code and Its Discontents 

After establishing the idea of the complexity reduction model, we 
must ask how the underlying stepwise, spatiotemporal code is 
generated. How do specific combinations of guidance cues 
selectively and precisely get to be at the right time and place 
to function as meaningful synapse specification signals? Here 
again, we face the blueprint problem: the establishment of a 
deterministic, spatiotemporally precise molecular code would 
be a problem of comparable complexity to the wiring diagram 
that it is supposed to explain. 

The blueprint problem holds irrespective of whether the code 
is determined through numbers of molecular cues, temporal 
control of molecular cues, or differential expression levels of 
molecular cues; it is not obviously easier to control the precise 
proteins levels of one cue in space and time than to control the 
precise combination of several cues only in space (Chan et al., 
2011). The example of gradients highlights one way to create 
repetition of similar structures and thereby reduce the amount 
of information, or codes, needed. Complexity can thus theoreti- 
cally be reduced through the exact repetition of a precise and 
deterministic address code system. However, repetitive struc- 
tures in brain wiring do typically, and maybe without exception, 
allow for some level of variability, revealing an underlying sto- 
chastic process. This variability may be functionally irrelevant 
(like differences in precise branching patterns, comp. Figures 
1B-1D), but it can reveal the underlying developmental rules 
and mechanisms: molecular cues that provide approximate 
guidance for many axons in parallel do not specify a precise 
one-to-one address code. Hence, variability may not just be 
due to the repetition of a slightly imprecise address code but, 
rather, an inherent outcome of a stochastic process that neither 
requires nor generates an address code. 

The complexity reduction model allows for variability by mak- 
ing guidance cues less precise. But how does lack of precision 
ensure robustness of the developmental process? We have 
already argued that noise is not simply an artifact that biology 
has to “live with” and try to reduce but is an integral part 
of how a developmental algorithm functions in brain wiring. 
How do guidance molecules deal with or even utilize noise? 
Numerous influential studies on cell adhesion and guidance re- 
ceptor functions in different model systems provide us with 
ample examples for a new and surprising understanding of 
what these molecules do. For example, we now know that, in 
Drosophila, an individual neuron’s choice of one out of thou- 
sands of Dscami isoforms is indeed unpredictable (Miura 
et al., 2013). Interestingly, a wealth of groundbreaking work on 
Dscami has revealed a function for this non-deterministic iso- 
form choice that is quite different from a precise address 
code: both Dscam and Protocadherins in vertebrates serve a pri- 
mary role in mediating self-avoidance, which requires distinction 
between self and non-self cell surfaces (Lefebvre et al., 201 2; Zi- 
pursky and Sanes, 201 0). Similarly, cell-intrinsic stochastic recy- 
cling of EGFR in the growth cones of higher-order visual system 
neurons in Drosophila is required to form a highly stereotyped 
axonal branching pattern (Zschatzsch et al., 2014). Variability 
does not arise in these examples by making a molecular code 
less precise. Instead, variability is a necessary outcome of 
pattern formation processes following simple rules. Specifically, 
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Dscams and Protocadherins implement a simple pattern forma- 
tion rule without providing a “cue” for axon targeting or synapse 
specification (Kise and Schmucker, 2013; Zipursky and Sanes, 
2010). Large numbers of randomly chosen isoforms serve 
this function, similar to randomly coded remote garage door 
openers; as long as there are enough different isoforms, the like- 
lihood is sufficiently low that your neighbor’s system has the 
same random recognition code as you do. This beautiful role 
of the many isoforms of Dscams and Protocadherins is more 
akin to a pattern formation process than a molecular synapse 
specification code. 

The idea of “non-cue” functions of guidance molecules is 
further highlighted by the identification of cell-intrinsic functions. 
If no target is involved in an intrinsic axonal targeting or branching 
choice, then the implicated molecules cannot function as external 
cues or address code (Petrovic and Schmucker, 201 5). Examples 
include neuropilin-1 and semaphorin-3a in the mammalian olfac- 
tory system (Imai et al., 2009), the protocadherin Flamingo in the 
fly visual system (Schwabe et al., 2013), and, indeed, Dscami 
in mechanosensory neurons (He et al., 2014). Variability in the 
outcome arises not from an imprecision of a molecular code 
but because these processes are intrinsically noisy and because 
they generate patterns, rather than specify connections. 

A picture emerges of how developmental algorithms can 
“encode” synaptic specificity in neural circuits. The question 
thereby is: what exactly are these developmental rules, and 
how can they explain wiring specificity in the nervous system? 

The Simple Rules that Can 

Any comprehensive mechanistic explanation for brain wiring 
must explain three fundamental characteristics in a single con- 
ceptual framework: wiring specificity, wiring variability, and the 
inclusion of stochastic processes during development. A rules- 
based framework does just that. Variability in neuronal branching 
patterns and neural circuit architecture offers a first glimpse into 
underlying developmental rules that generate complicated con- 
nections. For example, a set of simple rules that includes 
spacing between axons and self-avoidance can explain both 
the robust and variable properties of axonal targeting patterns. 
A classic and fundamental simple pattern formation rule is lateral 
inhibition, which is often molecularly implemented through Notch 
signaling. Indeed, recent work in Drosophila shows that this clas- 
sical pattern formation principle in cell differentiation is actively 
employed by axons of postmitotic neurons during neural circuit 
assembly the Drosophila brain (Langen et al., 2013). Here, sto- 
chastic patterning leads to spacing with a defined average be- 
tween neighboring axons without specifying either the precise 
number or distance between them. This is an example of a sim- 
ple rule that does not by itself specify a synaptic connection but 
must function as part of a larger developmental algorithm. 

In another recent example, the seemingly complex wiring prin- 
ciple of the Drosophila visual system known as “neural super- 
position” was computationally modeled based on three simple 
rules (Langen et al., 2015). Rather than explaining the simulta- 
neous targeting of ~5,000 growth cones through mechanisms 
of attractive and repulsive molecular cues, the entire synaptic 
specification process can be explained through pattern forma- 
tion principles executed as a concatenation of simple genetically 



encoded subprograms (Hiesinger et al., 2006; Langen et al., 
2015). This example presents all features discussed here: wiring 
specificity (almost error-free connectivity), variability in the pre- 
cise spatial placement of pre- and postsynaptic elements (which 
may or may not be functionally relevant), and stochastic pro- 
cesses throughout brain development from neuronal cell fate 
choice to growth cone dynamics. Importantly, previously identi- 
fied roles of cell adhesion molecules in this process fit seam- 
lessly into this framework as implementors of pattern formation 
rules, rather than molecular address codes. 

An important aspect of brain wiring models based on simple 
pattern formation rules is the “sorting together” of presumptive 
synaptic partners, such that synapse formation is likely to occur 
between meaningful partners (Hiesinger et al., 2006). Indeed, it is 
a curious fact that the actual process of synapse formation ap- 
pears to be astonishingly non-specific across species. Neurons 
that innervate incorrect target regions generally will form synap- 
ses wherever they end up, however wrong the targets. In fact, if 
given no other choice, neurons readily form synapses with them- 
selves (so-called autapses) that are functionally indistinguish- 
able from synapses in the brain (Bekkers and Stevens, 1991). 
Hence, as an ingredient for synapse-specific brain wiring, non- 
specific synapse formation only makes sense in the context of 
a larger developmental algorithm. Specifically, promiscuous 
synapse formation can be guided by precise sorting of the right 
partners; this is observed, for example, in the mammalian olfac- 
tory system or the fly visual system (Hiesinger et al., 2006; Imai 
et al., 2009). Alternatively, activity-dependent pruning can func- 
tion in sculpting specific layers harboring specific synaptic con- 
nections, as is observed in the lateral geniculate nucleus of the 
vertebrate visual system (Shatz, 1996). In addition, develop- 
mental rules inconsistent with address codes are revealed by hu- 
man patient data. Axons from the left eye normally establish 
connection in the visual areas of the right brain hemisphere 
and vice versa. In 2009, physicians described the case of a child 
born without a right brain hemisphere who nonetheless devel- 
oped both left and right visual fields, in both the lateral geniculate 
nucleus and visual cortex, in the left hemisphere alone, resulting 
in near normal vision (Muckli et al., 2009). Taken together, a pic- 
ture emerges of how a purely genetically encoded develop- 
mental program can lead to synaptic specificity in a neural circuit 
through a concatenation of simple genetically encoded subpro- 
grams that employ stochastic processes and allow for flexibility, 
without the need for an address code. 

What simple rules can, together with unspecific synapse for- 
mation, create synapse specific wiring? Rules can pattern 
axonal and dendritic architectures that result in specific synaptic 
partners. For example, the development of the complicated den- 
dritic tree of a Purkinje cell (Figure 1 B) has been modeled and 
computer simulated using simple dynamic processes, including 
stochastic terminal branching and retraction triggered by den- 
dritic contact (Fujishima et al., 2012). Based on similar “on-con- 
tact” rules, we would like to propose a simple theoretical exper- 
iment. This experiment reveals how two simple pattern formation 
rules can determine synaptic specificity when assuming that 
synaptogenesis can occur between any presynaptic-postsyn- 
aptic contact. We envision a schematic brain structure (Figure 2) 
organized into layers and columns with dynamically extending 
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Figure 2. A Theoretical Experiment: Given 
Promiscuous Synaptogenesis at Any Con- 
tact Site, Two Simple Rules Are Sufficient 
to Generate Layer- and/or Column-Specific 
Synaptic Contacts 

(A) The rule “stop on pre-pre (or same cell type) 
contact” prevents overlap of neighboring, parallel 
presynaptic terminals, leading to tiling in columns. 
The rule “stop on pre-post (or other cell type) 
contact” prevents overlap within the column; the 
area where pre- and postsynaptic terminals meet 
defines a layer. Synapses can subsequently form 
“unspecifically” between any pre-post contact and 
are yet restricted to a specific column and layer. 

(B) The “pre-pre” rule is sufficient to maintain col- 
umns, but without a “pre-post” rule, overlap between 
different cell types lead to loss of a restricted layer. 

(C) The “pre-post” rule is sufficient to maintain 
layers, but without a “pre-pre” rule, overlap be- 
tween the same presynaptic cell types leads to 
loss of columnar restriction. 



presynaptic axon terminals (blue) and postsynaptic dendritic 
trees (black). Synapses (red/pink dots) will form promiscuously 
at pre-postsynaptic contact sites. In this minimal setup, the 
two simple rules “stop on presynaptic-presynaptic contact” 
and “stop on presynaptic-postsynaptic contact” can generate 
both layer and column specificity (Figure 2A). Specifically, stop 
on contact between presynaptic axon terminals and dendritic 
branches restricts the area of synapse formation to a specific 
layer; stop on contact between presynaptic axon terminals en- 
sures that neither invade each other’s column. Correspondingly, 
loss of the “stop on pre-post contact” rule reduces layer speci- 
ficity while preserving columns (Figure 2B); loss of the “stop on 
pre-pre contact” rule reduces column specificity while preser- 
ving layers (Figure 2C). A similar observation has recently been 
made for starburst amacrine cells, which form autapses when 
self-avoidance is perturbed and fail to form connections with 
other cells when their contact recognition is perturbed. (Kostadi- 
nov and Sanes, 2015). Flence, simple rules such as those 
executed by cell adhesion molecules previously interpreted as 
‘guidance cues’ or ‘recognition codes’ can easily generate vari- 
ety in columnar and layer organization and synaptic specificity. 
Local interaction rules can be iteratively applied across layers 
and columns and thereby provide complexity reduction. Finally, 
the same rule may be executed by different molecules in different 
systems, as seen for Dscams and protocadherins. 

The examples discussed here showcase how simple rules can 
explain the establishment of synaptic specificity in seemingly 
complex wiring diagrams to a significant extent. All of these ex- 
amples are based on iteration of simple rules and thereby create 
repetitive structures. We speculate that less-repetitive organiza- 
tion can result in brain regions where different developmental al- 
gorithms overlap. However, many examples remain where more 
deterministic solutions based on true guidance and/or matching 
cues appear to provide a satisfactory explanation. For example, 
specific laminae in the vertebrate retina are thought to be defined 
by distinct guidance receptors (Matsuoka et al., 2011). In the 
Drosophila olfactory system, the Teneurins ten-m and ten-a are 
proposed to function as homophilic “match-making” molecules 
(Hong et al., 201 2). Importantly, the idea of match-making poses 
some significant constraints on stochasticity, as both pre- and 



postsynaptic sides must have “matching” molecular partners 
and thus deterministic molecular recognition pairs. As a general 
mechanism for synaptic specification throughout circuit assem- 
bly, the idea of match-making is not easily reconciled with sto- 
chastic developmental processes. On the other hand, the 
match-making roles found for these two proteins occur very 
late in the developmental process to distinguish between 
few targeting choices prior to synapse formation; thereby, 
match-making can be understood as an elegant terminal sub- 
program of a larger developmental algorithm that leads to syn- 
aptic specificity. 

On the Relation between Developmental Rules and 
Molecular Mechanisms 

Much insight into neural circuit assembly has been gained from 
single-mutant gene studies that disrupt development. Such ex- 
periments are often designed to reveal molecular mechanisms, 
including attractive or repulsive interactions requiring cell sur- 
face receptors during neural circuit assembly. The perspective 
of developmental rules differs from this approach in the following 
way: a molecular mechanism executes a developmental rule but 
may not reveal the rule itself. In contrast, the developmental al- 
gorithm is defined as the set of rules that are sufficient to 
generate robust and precise wiring. Developmental rules can 
be formulated independent of the molecular mechanisms that 
execute them, as shown in the theoretical experiment above 
(Figure 2). More specifically, classical molecular mechanisms 
of guidance cues and receptors include homophilic and hetero- 
philic binding, both of which can implement either attractive or 
repulsive responses. But neither of these mechanisms by them- 
selves reveal their roles as guidance cues. For example, the mo- 
lecular mechanism of Dscami (and all its isoforms) is homophilic 
repulsion; however, this mechanism does not reveal its true 
role in implementing the simple pattern formation rule of self- 
avoidance. In contrast, the self-avoidance rule can be quantita- 
tively formulated and understood in the absence of molecular 
knowledge. 

How then can we identify developmental rules independent of 
the molecular mechanisms that execute them? Curiously, the 
most common route has remained molecular perturbation 



Cell 163, October 8, 2015 ©2015 Elsevier Inc. 289 





Cell 



experiments. The assumption in any molecular perturbation 
experiment is that taking a specific part out will reveal meaningful 
behavior of the system through the observed response and thus 
define the function of that specific part. The added hope is that 
the part whose role is revealed tells us something about the 
developmental rule that it executed. However, the system is 
likely to exhibit compensatory responses and secondary effects 
that may be difficult to interpret. Molecular perturbation experi- 
ments are therefore more likely to reveal underlying rules when 
the perturbation is carried out with high spatial and temporal res- 
olution. For example, analyses of N-cadherin and Flamingo in the 
fly visual system revealed general principles only through 
detailed investigation of individual mutant growth cones in rela- 
tion to identified wild-type or mutant neighbors (Schwabe 
et al., 2013, 2014). 

The characterization of developmental rules independent of 
the molecules that execute them is probably best achieved 
through live observation with or without spatiotemporally con- 
trolled perturbation. However, the live observation approach de- 
mands the ability to observe growth cone behavior and synapto- 
genesis during a relevant time period in a developing neural 
circuit without interference and with sufficient spatial and tempo- 
ral resolution. Where this has been achieved, live observation of 
individual neurons and their interactions over time yielded impor- 
tant insight into the temporal succession, and thus causal con- 
straints, for underlying brain wiring processes (Langen et al., 
201 5). In another example, recent live imaging of neuronal migra- 
tion in the zebrafish retina revealed unexpected cellular behavior 
leading to amacrine cell lamination (Chow et al., 2015). It will 
therefore be interesting to extend the live observation to other 
cell types in the vertebrate retina and the fly olfactory system 
to see whether cellular behaviors are best explained by code- 
based target selection mechanisms or simple pattern formation 
rules. 

However we attempt to break down the often quoted “daunt- 
ing complexity” of the brain, a complete solution to the brain 
wiring problem may bear more similarity to other developmental 
tissues than our intuition at first suggests. Rules like lateral inhi- 
bition, self-avoidance, and gradient-based patterning, as well as 
underlying mechanisms like heterophilic interaction, homophilic 
repulsion, and molecular gradients, are well-established facets 
of the development of all tissues. As such, brain wiring is likely 
to a large extent an example of particularly complicated develop- 
mental patterning rather than a special problem onto itself. While 
it may seem safest to the engineer in us to explain brain 
complexity with an equally complicated code, the history of 
developmental biology teaches us differently again and again: 
simple rules! 

ACKNOWLEDGMENTS 

We would like to thank our laboratories and many friends and colleagues for 
enduring discussions leading to this text. In particular, we want to thank for 
critical and very valuable discussions and comments on this article: Tom Clan- 
dinin, Alex Kolodkin, Dietmar Schmucker, Nevine Shalaby, Steven Altschuler, 
Lani Wu, and Joris de Wit. Work in the Hassan laboratory is funded by VIB, Uni- 
versity of Leuven, FWO (G.0503.12), belspo (Wi Brain Network), EMBO,and the 
European Commission’s Marie Sklodowska Curie programs. Work in the Hie- 
singer laboratory at the Freie Universitat Berlin is funded by the NeuroCure 



Cluster of Excellence, the NIH (RO1EY018884, RO1EY023333), the Muscular 
Dystrophy Association, the Deutsche Forschungsgemeinschaft (SFB958), and 
the FU Berlin. 

REFERENCES 

Altschuler, S.J., and Wu, L.F. (2010). Cellular heterogeneity: do differences 
make a difference? Cell 141 , 559-563. 

Bekkers, J.M., and Stevens, C.F. (1991). Excitatory and inhibitory autaptic cur- 
rents in isolated hippocampal neurons maintained in cell culture. Proc. Natl. 
Acad. Sci. USA 88, 7834-7838. 

Carthew, R.W. (2007). Pattern formation in the Drosophila eye. Curr. Opin. 
Genet. Dev. 17 , 309-313. 

Chan, C.C., Epstein, D., and Hiesinger, P.R. (2011). Intracellular trafficking in 
Drosophila visual system development: a basis for pattern formation through 
simple mechanisms. Dev. Neurobiol. 71 , 1227-1245. 

Charron, F., and Tessier-Lavigne, M. (2005). Novel brain wiring functions for 
classical morphogens: a role as graded positional cues in axon guidance. 
Development 132 , 2251-2262. 

Chow, R.W., Almeida, A.D., Randlett, O., Norden, C., and Harris, W.A. (2015). 
Inhibitory neuron migration and IPL formation in the developing zebrafish 
retina. Development 142 , 2665-2677. 

Clarke, P.G. (2012). The limits of brain determinacy. Proc. Biol. Sci. 279, 1665- 
1674. 

de la Pompa, J.L., and Epstein, J.A. (2012). Coordinating tissue interactions: 
Notch signaling in cardiac development and disease. Dev. Cell 22, 244-254. 
Fujishima, K., Horie, R., Mochizuki, A., and Kengaku, M. (2012). Principles of 
branch dynamics governing shape characteristics of cerebellar Purkinje cell 
dendrites. Development 139 , 3442-3455. 

Ganguly, K., and Poo, M.M. (2013). Activity-dependent neural plasticity from 
bench to bedside. Neuron 80 , 729-741 . 

He, H., Kise, Y., Izadifar, A., Urwyler, O., Ayaz, D., Parthasarthy, A., Yan, B., 
Erfurth, M.L., Dascenco, D., and Schmucker, D. (2014). Cell-intrinsic require- 
ment of Dscami isoform diversity for axon collateral formation. Science 344 , 
1182-1186. 

Hiesinger, P.R., Zhai, R.G., Zhou, Y., Koh, T.W., Mehta, S.Q., Schulze, K.L, 
Cao, Y., Verstreken, P., Clandinin, T.R., Fischbach, K.F., et al. (2006). 
Activity-independent prespecification of synaptic partners in the visual map 
of Drosophila. Curr. Biol. 16 , 1835-1843. 

Hong, W., Mosca, T.J., and Luo, L. (2012). Teneurins instruct synaptic partner 
matching in an olfactory map. Nature 484 , 201-207. 

Imai, T., Yamazaki, T., Kobayakawa, R., Kobayakawa, K., Abe, T., Suzuki, M., 
and Sakano, H. (2009). Pre-target axon sorting establishes the neural map 
topography. Science 325 , 585-590. 

Kise, Y., and Schmucker, D. (2013). Role of self-avoidance in neuronal wiring. 
Curr. Opin. Neurobiol. 23 , 983-989. 

Kolodkin, A.L., and Tessier-Lavigne, M. (2011). Mechanisms and molecules of 
neuronal wiring: a primer. Cold Spring Harb. Perspect. Biol. 3, 3. 

Kostadinov, D., and Sanes, J.R. (2015). Protocadherin-dependent dendritic 
self-avoidance regulates neural connectivity and circuit function. eLife 4 , 4. 
Langen, M., Koch, M., Yan, J., De Geest, N., Erfurth, M.L., Pfeiffer, B.D., 
Schmucker, D., Moreau, Y., and Hassan, B.A. (2013). Mutual inhibition among 
postmitotic neurons regulates robustness of brain wiring in Drosophila. eLife 2, 
e00337. 

Langen, M., Agi, E., Altschuler, D.J., Wu, L.F., Altschuler, S.J., and Hiesinger, 
P.R. (2015). The Developmental Rules of Neural Superposition in Drosophila. 
Cell 162 , 120-133. 

Lefebvre, J.L., Kostadinov, D., Chen, W.V., Maniatis, T., and Sanes, J.R. 
(2012). Protocadherins mediate dendritic self-avoidance in the mammalian 
nervous system. Nature 488 , 517-521. 

Losick, R., and Desplan, C. (2008). Stochasticity and cell fate. Science 320 , 
65-68. 



290 Cell 163 , October 8, 2015 ©2015 Elsevier Inc. 




Cell 



Matsuoka, R.L., Nguyen-Ba-Charvet, K.T., Parray, A., Badea, T.C., Chedotal, 
A., and Kolodkin, A.L. (201 1). Transmembrane semaphorin signalling controls 
laminar stratification in the mammalian retina. Nature 470 , 259-263. 

Mele, M., Ferreira, P.G., Reverter, F., DeLuca, D.S., Monlong, J., Sammeth, 
M., Young, T.R., Goldmann, J.M., Pervouchine, D.D., Sullivan, T.J., et al.; 
GTEx Consortium (2015). Human genomics. The human transcriptome across 
tissues and individuals. Science 348 , 660-665. 

Miura, S.K., Martins, A., Zhang, K.X., Graveley, B.R., and Zipursky, S.L. (2013). 
Probabilistic splicing of Dscami establishes identity at the level of single neu- 
rons. Cell 155 , 1166-1177. 

Muckli, L., Naumer, M.J., and Singer, W. (2009). Bilateral visual field maps in a 
patient with only one hemisphere. Proc. Natl. Acad. Sci. USA 106 , 13034- 
13039. 

Petrovic, M., and Hummel, T. (2008). Temporal identity in axonal target layer 
recognition. Nature 456 , 800-803. 

Petrovic, M., and Schmucker, D. (2015). Axonal wiring in neural development: 
Target-independent mechanisms help to establish precision and complexity. 
BioEssays 37 , 996-1004. 

Prusinkiewicz, P., and Lindenmayer, A. (1990). The Algorithmic Beauty of 
Plants (Springer). 

Raper, J., and Mason, C. (2010). Cellular strategies of axonal pathfinding. Cold 
Spring Harb. Perspect. Biol. 2, a001933. 

Schmucker, D., Clemens, J.C., Shu, H., Worby, C.A., Xiao, J., Muda, M., 
Dixon, J.E., and Zipursky, S.L. (2000). Drosophila Dscam is an axon guidance 
receptor exhibiting extraordinary molecular diversity. Cell 101 , 671-684. 



Schwabe, T., Neuert, H., and Clandinin, T.R. (2013). A network of cadherin- 
mediated interactions polarizes growth cones to determine targeting speci- 
ficity. Cell 154 , 351-364. 

Schwabe, T., Borycz, J.A., Meinertzhagen, I.A., and Clandinin, T.R. (2014). Dif- 
ferential adhesion determines the organization of synaptic fascicles in the 
Drosophila visual system. Curr. Biol. 24 , 1304-1313. 

Shatz, C.J. (1996). Emergence of order in visual system development. Proc. 
Natl. Acad. Sci. USA 93, 602-608. 

Srahna, M., Leyssen, M., Choi, C.M., Fradkin, L.G., Noordermeer, J.N., and 
Hassan, B.A. (2006). A signaling network for patterning of neuronal connectiv- 
ity in the Drosophila brain. PLoS Biol. 4 , e348. 

West, A.E., and Greenberg, M.E. (2011). Neuronal activity-regulated gene tran- 
scription in synapse development and cognitive function. Cold Spring Harb. 
Perspect. Biol. 3, 3. 

Williamson, W.R., Yang, T., Terman, J.R., and Hiesinger, P.R. (2010). Guidance 
receptor degradation is required for neuronal connectivity in the Drosophila 
nervous system. PLoS Biol. 8 , el 000553. 

Yogev, S. , and Shen, K. (201 4). Cellular and molecular mechanisms of synaptic 
specificity. Annu. Rev. Cell Dev. Biol. 30 , 417-437. 

Zipursky, S.L., and Sanes, J.R. (2010). Chemoaffinity revisited: dscams, proto- 
cadherins, and neural circuit assembly. Cell 143 , 343-353. 

Zschatzsch, M., Oliva, C., Langen, M., De Geest, N., Ozel, M.N., Williamson, 
W.R., Lemon, W.C., Soldano, A., Munck, S., Hiesinger, P.R., et al. (2014). 
Regulation of branching dynamics by axon-intrinsic asymmetries in Tyrosine 
Kinase Receptor signaling. eLife 3, e01699. 



Cell 163, October 8, 2015 ©2015 Elsevier Inc. 



291 




Cell 



Leading Edge 

Perspective 



Pausing on Polyribosomes: 

Make Way for Elongation in Translational Control 

Joel D. Richter^’* and Jeff Coller^ * 

■■Program in Molecular Medicine, University of Massachusetts Medical School, Worcester, MA 01605, USA 
^Center for RNA Molecular Biology, Case Western Reserve University, Cleveland, OH 44106, USA 
*Correspondence: joel.richter@umassmed.edu (J.D.R.), jmc71@case.edu (J.C.) 
http://dx.d 0 i. 0 rg/l 0.101 6/j.cell.201 5.09.041 



Among the three phases of mRNA translation — initiation, elongation, and termination — initiation 
has traditionally been considered to be rate limiting and thus the focus of regulation. Emerging ev- 
idence, however, demonstrates that control of ribosome translocation (polypeptide elongation) can 
also be regulatory and indeed exerts a profound influence on development, neurologic disease, and 
cell stress. The correspondence of mRNA codon usage and the relative abundance of their cognate 
tRNAs is equally important for mediating the rate of polypeptide elongation. Here, we discuss 
recent results showing that ribosome pausing is a widely used mechanism for controlling transla- 
tion and, as a result, biological transitions in health and disease. 



Introduction 

Since translational control became a distinct field of study, the 
term “control” for many investigators was synonymous with initi- 
ation, the first and most complicated phase of protein synthesis. 
Initiation includes formation of the 43S pre-initiation complex, its 
association with the mRNA 5' terminal 7mG cap in coordination 
with the elF4F (elF4A, elF4G, and elF4E) complex, scanning of 
the 40S ribosomal subunit to the initiation AUG codon, and 
joining of the 60S subunit to form the SOS monosome (Hinne- 
busch 2014). 

Cells generally contain a dearth of the cap-binding factor elF4E 
(Mamane et al., 2004), and its interaction with elF4G and hence its 
ability to recruit the translational apparatus is widely regulated by 
different classes of protein factors, including the elF4E binding 
proteins (4EBPs) (Richter and Sonenberg, 2005). Additionally, 
scanning of the 40S ribosomal subunit along the mRNA can be 
impeded by interacting proteins or secondary structure in the 
5'UTR. Given these distinct control points, tradition has dictated 
that initiation would be rate limiting for protein synthesis. More- 
over, it makes intuitive sense that the first step in translation 
would be the most likely to be regulated. However, emerging ev- 
idence indicates that polypeptide elongation (ribosome transit) 
can also be regulatory and indeed may be critical for controlling 
early development, neural function, and cancer etiology. Here, 
we review salient observations pointing to an important role for 
regulated ribosome translocation in diverse biological contexts. 

Translational Elongation at a Glance 

Translational initiation involves dozens of individual polypep- 
tides, and given its complexity, it is not surprising that distinct 
sub-steps can be regulated. In comparison, translational elonga- 
tion is relatively simple. In concert with elongation factor EF-1/ 
EF-TU and guanosine triphosphate (GTP), an aminoacylated 
tRNA enters the A site of the ribosome; cognate tRNA-mRNA 
codon recognition then stimulates GTP hydrolysis and eviction 
of EF-1 /EF-TU from the A site. Concomitantly, the ribosome un- 



dergoes a conformational shift, stimulating contact between the 
3' ends of the aminoacylated tRNA in the A site and the tRNA 
bearing the polypeptide chain in the P site. When the two tRNAs 
shift position (A to P and P to E site), peptide bond formation oc- 
curs as the polypeptide is now transferred to the aminoacylated 
tRNA, extending the protein by one amino acid. A second elon- 
gation factor, EF-2/EF-G, then enters the A site, hydrolyzing GTP 
and resetting the ribosome to a conformation competent to 
receive the next aminoacylated tRNA in the A site. The process 
repeats itself over and over again (Figure 1). 

Despite the “simplicity” of elongation, regulation can and does 
occur. Indeed, for decades, we have known that ribosomes stall 
after reading only the first 5-30 codons of mRNAs encoding 
secreted proteins (Siegel and Walter 1988; Halic et al., 2004). 
This activity requires the signal recognition particle (SRP), which 
binds the N terminus of the nascent polypeptide and simulta- 
neously inserts itself into the ribosome A site (Halic et al., 
2004). Docking of SRP in the ribosomal A site blocks further 
tRNA entry, arresting elongation until the ribosome/mRNA com- 
plex is localized to the endoplasmic reticulum. The lesson from 
SRP function is that A-site occlusion is a viable and potent 
means to arrest translational elongation. Thus, any factor (pro- 
tein or RNA) that can interact in or near the A site has the potential 
to stall elongation by blocking tRNA entry. 

In this regard, there are many known factors that do interact 
with the A site— all for distinct reasons. The release factors, 
eRFI and eRF3, the elongation factor EF-G, the ribosome recy- 
cling factors/mRNA decay factors DOM34 and HBS1 , and the 
mRNA decay factor SKI7 are all thought to interact at the A 
site. Thus, the A site is a busy place and a potential target for 
mRNA-specific regulation. A-site occlusion could easily be 
achieved by a message-specific regulator provided it has 
sequence-specific binding properties for its mRNA transcript 
and a motif capable of A-site docking. 

The SRP example demonstrates, in clear molecular terms, 
how elongation can be regulated. Importantly, the literature 
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contains a number of less clear but still tantalizing glimpses of 
where modulating ribosome translocation might be a driving 
force in regulation. In several of these cases, regulated elonga- 
tion is inferred from observations that repressed mRNA co-sed- 
iments with polysomes in sucrose gradients and/or that this co- 
sedimentation is resistant to puromycin treatment, which causes 
release of translating ribosomes. Nevertheless, accumulating 
evidence hints to a broad influence of translational elongation 
on the control of gene expression. 

Control of Elongation during Early Development 

Mechanisms of translational control during the early develop- 
ment of model organisms are often recapitulated in adult 
mammalian tissues. Consider, for example, the case of masked 
(i.e., repressed) mRNA in the oocytes (eggs) of sea urchins and 
frogs. As the oocytes prepare for fertilization during early 
embryogenesis, this mRNA is massively mobilized onto poly- 
somes, which coincides with a substantial decrease in ribosome 
translocation time (Brandis and Raff 1978; Richter et al., 1982). 
These and other observations of this era now seem archaic 
because, for the most part, they pre-dated one’s ability to assess 
the time required for a ribosome to transit any particular mRNA. 
Even so, they illustrate the point that polypeptide elongation 
rates can be regulated by cellular transitions. 

Masked mRNA is also a hallmark of Drosophila development. 
Here, the translation of nanos mRNA, which encodes a poste- 
rior pole determinant, is regulated both spatially and temporally. 
Although nanos mRNA translation is controlled in multiple ways. 



Figure 1. Translational Elongation at a 
Glance 

Shown are the four basic steps of translational 
elongation. The ribosome has three major tRNA 
pockets, the A, P, and E sites. The first step of 
polypeptide elongation is the recognition and ac- 
commodation of the cognate tRNA, as directed by 
a mRNA codon, within the ribosomal A site (i). The 
cognate tRNA is brought into the ribosome as a 
complex with elongation factor 1 (EF-TU in bac- 
teria) and GTP. Recognition of the cognate tRNA 
catalyzes the hydrolysis of GTP and the eviction of 
EF1 from the A site (ii). At this point, the deacylated 
tRNA in the E site is also thought to be evicted. The 
A-site tRNA and the P-site tRNA move into close 
proximity for the peptidyl transfer reaction, where 
the growing polypeptide chain is added to the 
amino acid on the A-site tRNA (ill). Elongation 
factor 2 (EF-G in bacteria) then enters the A site 
and completes ribosome translocation by moving 
the A-site tRNA to the P site and the P-site de- 
acylated tRNA into the E site (iv). The process then 
repeats itself over and over again. 



one is by ribosome stalling (Clark et al., 
2000; Andrews et al., 2011). Nanos 
mRNA co-sediments with polyribosomes 
in sucrose gradients even though no 
Nanos protein is detected; yet, when 
the polysomes were added to an in vitro 
ribosome run-off system, the stalled poly- 
somes resumed their transit and pro- 
duced Nanos protein, showing that they 
were paused rather than immobilized in an inactive form. This 
scenario is somewhat similar to that observed with oskar 
mRNA, another posterior pole determinant in Drosophila. Oskar 
mRNA also co-sediments with polysomes even though little Os- 
kar protein is detected (Braat et al., 2004). Moreover, when 
added to an in vitro translation system derived from ovaries, pu- 
romycin, an antibiotic that acts on translating ribosomes by 
mimicking tRNA and causing premature polypeptide release 
and ribosome dissociation, caused only a partial shift of the sedi- 
mentation of oskar mRNA to lighter fractions of sucrose gradi- 
ents, suggesting that it is associated with stalled ribosomes. 

Micro RNAs, which have profoundly changed our notion of 
how biological processes are regulated, were discovered during 
examination of C. elegans larval development (Lee et al., 1993; 
Wightman et al., 1993). In spite of the huge number of studies 
that have analyzed miRNA activity, the mechanism(s) by which 
they silence mRNA expression remains somewhat enigmatic, 
perhaps because they repress translation a number of different 
ways. That mRNAs silenced by miRNAs are often eventually de- 
stroyed is beyond doubt, but the step(s) at which the silencing 
occurs is seemingly manifold. Olsen and Ambros (1999) noted 
that although C. elegans lin-4 miRNA inhibited Lin-14 mRNA 
translation, the message appeared to be stable and co-sedi- 
mented with polysomes in sucrose gradients. This observation 
gave rise to the hypothesis that miRNAs repress translation by 
stalling ribosomes. Using cell lines, several labs subsequently 
found miRNA-inhibited mRNAs that also co-sedimented with 
polysomes (Nottrott et al., 2006; Maroney et al., 2006), and one 
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proposed that miRNAs promote pre-mature drop-off of trans- 
lating ribosomes (Petersen et al., 2006). Although ribosome pro- 
files derived from developing zebrafish embryos showed that at 
least miR-430 did not induce ribosome drop off (Bazzini et al., 
2012), it remains an open question as to the extent to which 
miRNAs can promote post-initiation mRNA silencing. 

Synaptic Plasticity in Neurons 

Neuronal processes, particularly dendrites, have long been 
known to harbor mRNAs whose translation is critical for synaptic 
plasticity, the underlying cellular basis of learning and memory 
(Kang and Schuman 1996; Martin et al., 1997). The regulation 
of dendritic mRNA translation must be considered in conjunction 
with cellular localization as mRNAs are transported from cell 
bodies into dendrites on molecular motors in a mostly silent 
form. The mRNAs are then activated in response to synaptic ac- 
tivity (Kanai et al., 2004). Early work found that a substantial 
portion of neuronal mRNAs reside in granules that sediment in 
sucrose gradients to fractions much heavier than polysomes. 
Electron microscopy revealed these granules to be composed 
of densely packed polysomes, but because they lack the initia- 
tion factors elF4E and elF4G, they were thought to represent 
stalled ribosomes. Membrane depolarization of neurons by 
KCI-stimulated translation partially dispersed the aggregates, 
suggesting that the stalled polysomes resumed translation (Kri- 
chevsky and Kosik 2001). 

In a contemporaneous study, Scheetz et al. (2000) stimulated 
synaptoneurosomes (a biochemical preparation of pre- and 
post-synaptic compartments) isolated from rat brain with the 
neurotransmitter N-methyl-D-aspartate (NMDA) and observed 
an increase in eEF2 phosphorylation, which would inactivate 
the enzyme and thus slow ribosome translocation. These inves- 
tigators found a concurrent increase in the synthesis of the 
critical synaptic protein alpha calcium/calmodulin protein kinase 
II (aCaMKII) and hypothesized that inhibition of elongation of 
some mRNAs allows for the elevated translation of other mRNAs 
by mechanisms involving enhanced initiation. This hypothesis 
suggests that phospho-eEF2 could discriminate among mRNAs, 
which might be accomplished by, for example, spatial segrega- 
tion of some components of the translational apparatus. 

A link between eEF2 phosphorylation and synaptic activity 
was also observed by Sutton et al. (2007), who found that 
eEF2’s enzymatic activity can be toggled by the type of neuro- 
transmission to which a neuron is subjected. eEF2 is mostly non- 
phosphorylated when neurotransmission is evoked by action 
potentials. On the other hand, miniature synaptic transmission, 
which is spontaneous in nature and independent of action poten- 
tials, results in eEF2 phosphorylation. As might be expected, 
protein synthesis is up or downregulated depending on the state 
of eEF2 phosphorylation. For this reason, Sutton et al. (2007) 
proposed that eEF2 is a sensor that links synaptic activity to local 
(i.e., dendritic) control of polypeptide elongation. 

More contemporary studies are consistent with these findings. 
Using an indirect in vivo ribosome run-off assay in neurons, 
Graber et al. (2013) found that mRNAs in dendrites are associ- 
ated with stalled polysomes that can be reactivated by stimula- 
tion of metabotropic glutamate receptors, which induces long- 
term depression (LTD), a protein synthesis-dependent form of 



synaptic plasticity. Buxbaum et al. (2014) used three-color fluo- 
rescence in situ hybridization (FISH) and single-molecule detec- 
tion technology to explore actin mRNA masking/unmasking 
dynamics in neuronal dendrites. Their results buttress the inter- 
pretation that quiescent mRNAs in neurons associate with 
stalled ribosomes and that induction of long-term potentiation 
(LTP), another form of synaptic plasticity, activates the stalled 
ribosomes to complete translation. 

Ribosome Stalling in Fragile X Syndrome 

Fragile X syndrome is the most common form of inherited intel- 
lectual disability and most frequent monogenic cause of autism. 
The syndrome is caused by a CGG repeat expansion in the 
Fmr1 gene, which causes its transcriptional inactivation. Fmr1 
encodes FMRP, an RNA binding protein that represses transla- 
tion. In the absence of FMRP, protein synthesis in the brain 
is excessive, and it is commonly thought that this leads to syn- 
aptic dysfunction and other anomalies associated with this 
disease. 

Several studies have shown that FMRP co-sediments with 
polysomes, suggesting that it inhibits translation by stalling ribo- 
somes (Feng et al., 1 997; Corbin et al., 1 997; Stefani et al., 2004). 
In a groundbreaking study by Darnell et al. (2011), FMRP was 
found to crosslink to nearly 1,000 mRNAs in the brain in an 
experiment where UV irradiation to covalently link RNAs and pro- 
teins is followed by immunoprecipitation and deep sequencing 
(this procedure is referred to as “CLIP”). Surprisingly, most of 
the sites in mRNA to which FMRP was crosslinked were distrib- 
uted in coding regions in a c/s element-independent manner. 
Moreover, extensive analysis of the associated mRNAs showed 
them to co-sediment with polysomes but to be largely resistant 
to puromycin treatment (because puromycin causes dissocia- 
tion of transiting, but not static ribosomes, one infers that ribo- 
somes do not move, or move only very slowing, on FMRP-bound 
mRNAs). By contrast, mRNAs not crosslinked to FMRP dissoci- 
ated from polysomes following puromycin treatment and sedi- 
mented with the nontranslating ribonucleoprotein (RNP) frac- 
tions of sucrose gradients. Therefore, a synthesis of these two 
observations— that FMRP binds to coding regions and that the 
ribosomes on such mRNAs do not transit— strongly implies 
that FMRP stalls ribosomes. 

In a number of cases, mouse models have shown that the 
excessive protein synthesis in the Fragile X brain can be restored 
to normal levels if a second gene is deleted, which leads rescue 
of many disease phenotypes (Dolen et al., 2007; Bhattacharya 
et al., 2012; Udagawa et al., 2013; Gross et al., 2015). For 
example, Udagawa et al. (2013) focused on CPEB1, which 
generally (although not necessarily exclusively) stimulates trans- 
lation in the brain and co-localizes and co-immunoprecipitates 
with FMRP. The binding site for CPEB1 is present in about 
30% of the FMRP-associated mRNAs and thus has the potential 
to regulate their expression. These authors proposed that trans- 
lational homeostasis in the brain might be restored if the CPEB1 
gene, as well as Fmr1 , was disrupted. Indeed, not only was the 
excessive protein synthesis restored to normal levels in FMRP/ 
CPEB double-knockout mice, but so too were a variety of path- 
ophysiologies associated with Fragile X. Moreover, Udagawa 
et al. (201 3) found that ribosome transit time in the Fragile X brain 
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was ~45% faster than in the wild-type brain, which was rescued 
to normal in the FMRP/CPEB1 double-knockout brain. How 
the loss of CPEB1 slows ribosome translocation in the FMRP- 
deficient brain is unknown, but irrespective of the mechanism 
involved, these data directly demonstrate that FMRP controls 
polypeptide elongation and that Fragile X may be a disease, at 
least in part, of accelerated ribosome translocation. 

Because the ribosome has intrinsic helicase-like activity and is 
capable of removing protein/RNA complexes as it moves along 
the mRNA (Takyar et al., 2005), FMRP acting as a simple road- 
block to stall ribosomes, while tenable as a model, seems too 
simplistic. An intriguing structural study of Drosophila FMRP 
showed that it binds directly to the ribosome via an interaction 
with ribosomal protein L5 (Chen et al., 201 4). The two KH (hnRNP 
K homology) domains of FMRP interact with the ribosome, while 
the single RGG (arginine-glycine-glycine) box could associate 
directly with RNA, suggesting that FMRP may act as a bridge be- 
tween the two to impede ribosome movement (Figure 2). How- 
ever, such a configuration does not predict whether the speci- 
ficity of FMRP binding is imparted by its association with the 
mRNA or the ribosome. 

One extant question arising from the above discussion is 
whether FMRP-mediated translational repression is permanent 
or reversible. At least one form of synaptic plasticity, LTD, in- 
duces FMRP phosphorylation and ubiquitin-mediated destruc- 
tion (Nalavadi et al., 2012; Huang et al., 2015), which one sur- 
mises would remove the block to ribosome translocation and 
allow polypeptide elongation to proceed. 

Ribosome Stalling in Neurodegeneration 

The brain contains the most complex mixture of mRNAs of any 
adult tissue, and therefore, it is not surprising that translational 



Figure 2. Three Examples of Regulated 
Polypeptide Elongation 

FMRP is proposed to bind both the ribosome and 
the engaged mRNA to impede ribosome trans- 
iocation. FMRP is not produced when the Fmr1 
gene is inactivated, which resuits in an eievated 
rate of poiypeptide eiongation and the Fragiie X 
syndrome (top), in the brain, a mutated tissue- 
specific tRNA causes ribosome staliing at its cor- 
responding codon. Neurodegeneration occurs if 
GTPBP2 is aiso mutated (middie). During proteo- 
toxic stress, the chaperone HSC70, which nor- 
maiiy binds the nascent peptide as it emerges 
from the ribosome, is titrated by misfoided pro- 
teins and causes ribosome staiiing after reading 
about 50 codons (bottom). 



control is becoming a hallmark of neural 
development and function. One stunning 
observation on translational control at 
the level of ribosome translocation 
came from a mutagenesis screen for 
neurological disorders in mice where 
one line in particular displayed profound 
brain degeneration, ataxia, and death 
by about 2 months of age (Ishimura 
et al., 2014). Mapping the mutation by 
crosses to congenic mouse strains combined with SNP analysis 
showed that the mutagen (N-ethyl-N-nitrosourea) produced a 
point mutation in a splice site of the Gtpbp2 gene, which en- 
codes a protein with homology to GTPases involved in transla- 
tion, particularly the ribosome recycling factors HBS1 and 
eRF3. These two proteins interact with the ribosome release 
factors Dom34 (yeast nomenclature) and eRFI, and GTPBP2 
does indeed co-immunoprecipitate with Pelota, the mammalian 
homolog of Dom34. However, neurodegeneration was manifest 
only in the commonly used C57BLV6J background, suggesting 
that a second modifier gene specific to this strain was neces- 
sary to produce the phenotype. The modifier gene was found 
to encode an arginine tRNA isodecoder (isodecoder tRNAs 
share the same anticodon but have changes elsewhere in the 
molecule). This arginine tRNAycu, which contains a single C 
to U mutation in the T-stem loop that is likely to result in 
RNA misfolding, is CNS specific; in contrast, GTPBP2 is widely 
expressed. Ishimura et al. (2014) hypothesized that the 
mutant tRNA'^''^ucu would cause the ribosome to stall at 
AGA codons. Indeed, ribosome profiling (Ingolia et al., 2009) re- 
vealed a particularly high number of reads at AGA codons in 
cerebellar material containing the mutant tRNA, indicating 
strong ribosome stalling at these sites. Such strong stalling 
was not evident in cerebellar tissue from animals with wild- 
type tRNA'^‘'^ucu (Ishimura et al., 2014). Thus, ribosome stalling 
at AGA codons caused by the mutant tRNA may allow mutant 
GTPBP2 to recruit Pelota and promote premature polypeptide 
release, resulting in neurodegeneration (Figure 2) (Ishimura 
et al., 2014; Darnell 2014). tRNA mis-charging (Lee et al., 
2006) or mis-folding appears to have particularly profound con- 
sequences for CNS function, suggesting that disruption of tRNA 
activity could contribute to other CNS pathologies as well. 
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Polysome Pausing by Misfolded Proteins 

Cellular stress has long been known to cause reduced transla- 
tion and promote the formation of stress granules and process- 
ing bodies (p bodies) where mRNAs have been hypothesized to 
undergo silencing and decay, respectively (Anderson and Keder- 
sha 2009; Decker and Parker 2012). Proteotoxicity can be one 
source of cellular stress and can be simulated by feeding cells 
with amino acid analogs that result in misfolded proteins. Liu 
et al. (2013) found that culturing cells in L-azetdine-2-carboxylic 
acid (AZC), a proline analog, caused rapid turnover of newly syn- 
thesized protein, and when AZC treatment was combined with 
the proteasome inhibitor MG132, protein synthesis substantially 
decreased. This treatment did not induce rapid stress granule 
formation or elF2a phosphorylation, which are common stress- 
induced events (Proud, 2005), nor did it alter other parameters 
normally associated with a block at initiation. Instead, AZC and 
MG132 caused polypeptide elongation to slow. Ribosome 
profiling showed that the stress promoted ribosome stalling 
within ~50 codons of the initiation AUG, enough to encode a 
polypeptide partially buried in the exit tunnel of the ribosome 
and partially exposed to the cellular milieu. Liu et al. (2013) hy- 
pothesized that, because the chaperone HSC70 binds and helps 
fold nascent peptides as they emerge from the ribosome, it 
would naturally facilitate ribosome transit. During proteotoxic 
stress, however, the chaperone may be titrated by misfolded 
protein and thus cause ribosome stalling and reduced translation 
(Figure 2). 

Heat shock, perhaps of the most common form of stress, has 
been known for decades to inhibit protein synthesis at least 
partly at the level of elongation (Ballinger and Pardue 1983). Us- 
ing ribosome profiling, Shalgi et al. (2013) found that heat stress 
induces widespread stalling at about codon 65. Similar to the 
data presented by Liu et al. (2013), this study found that 
HSP70 was responsible for the ribosomal stall. Although Shalgi 
et al. (2013) did not determine how the chaperone induces stall- 
ing during heat stress, they suggest that it may involve HSP70 
association with the emergent nascent peptide, that it somehow 
clogs the peptide exit tunnel, or that it mitigates elongation factor 
activity. 

An additional mode by which heat shock proteins modulate 
elongation comes out of structural analysis of yeast ribosomes 
(Zhang et al., 2014). A ribosome-associated complex (RAC), 
which is composed of HSP40 and HSP70, binds both ribosomal 
subunits through a single long alpha helix. As a consequence, a 
necessary rotation between the subunits during translation is 
limited and thus ribosome translocation is reduced. Zhang 
et al. (201 4) propose that RAC somehow responds to the folding 
needs of the nascent peptide as it emerges from the exit tunnel 
and binds the ribosome to reduce translocation such that the 
peptide can assume a productive tertiary structure. 

Caloric restriction or nutrient deprivation also induces cell 
stress. Although a number of proteins can sense that nutrients 
are in short supply, the major one is mTORCI (mTOR complex 
1). When amino acid levels are low, this kinase is inactivated, 
which leads to a number of downstream dephosphorylation 
events, including that of 4EBP1 . The non-phosphorylated form 
of this protein disrupts the ability of elF4E to bind elF4G and re- 
cruit factors necessary to initiate translation. A second nutrient 



sensor is AMP kinase (AMPK). In response to amino acid starva- 
tion, it downregulates protein synthesis and other high ATP- 
demanding processes in an effort to conserve energy. One 
way AMPK accomplishes this task is by activating eEF2 kinase 
(eEF2K), which reduces elongation by phosphorylating eEF2 
(Leprivier et al., 2013). Through this kind of energy conservation, 
AMPK-mediated eEF2 phosphorylation promotes cell survival in 
the face of nutrient restriction. 

Perhaps the most unusual but well-defined example of stress- 
regulated elongation is yeast Had mRNA, which encodes a 
transcription factor involved in the unfolded protein response 
(UPR). As the 5'end of Had mRNA emerges, it associates with 
ribosomes and begins to be translated. When the 3' UTR enters 
the cytoplasm, an intron contained within it base pairs with the 
5'UTR, thereby forming a closed loop that stalls the ribosomes. 
In response to ER stress, the 3' UTR intron is removed by the 
nuclease Irelp and the RNA ends are unconventionally spliced 
by tRNA ligase, which consequently allows the ribosomes to 
continue to catalyze polypeptide elongation (Chapman and Wal- 
ter 1997; Ruegsegger et al., 2001). 

Regulated Elongation during Oncogenic Transformation 

The AMPK-eEF2 kinase pathway mediates tumorigenesis, as 
well as caloric-restriction-induced stress. When tumor cells 
with low levels of eEF2K are starved, polypeptide elongation re- 
mains robust and energy is consumed at a high rate, thereby 
causing the cells to undergo apoptosis. However, some tumor 
cells can adapt to starvation conditions by upregulating the 
AMPK-eEF2 pathway, which inhibits polypeptide elongation, 
conserves energy, and promotes cell survival. 

mTOR is a master regulator of cell physiology and can induce 
many types of cancer through phosphorylation of 4EBP1 and 
other substrates. As noted previously, 4EBP1 inhibits initiation 
by competitively binding elF4E to the exclusion of elF4G. 
When 4EBP1 is phosphorylated, it dissociates from elF4E, which 
allows elF4E, elF4G, and elF4A to form the elF4F initiation com- 
plex. As a consequence of these events, mRNAs with particularly 
long and complex 5' UTRs are preferentially translated because 
elF4A is an RNA helicase that unwinds RNA secondary structure. 
Many mRNAs encoding oncogenes or growth-promoting factors 
have complex 5' UTRs and are upregulated by mTOR and pro- 
mote cellular transformation (Pelletier et al., 2015). mTOR can 
also modulate polypeptide elongation to facilitate cell prolifera- 
tion and cancer etiology. In intestinal cancer caused by muta- 
tions in adenomatous polyposis coli (APC), a tumor suppressor, 
mTORCI can modulate translation via phosphorylation of 
4EBP1 or S6 kinase (S6K), the latter of which is an upstream in- 
activator of eEF2K. Thus, at least for this cancer type, tumor 
growth is regulated by an axis of mTORCI -S6K-eEF2K-eEF2, 
which culminates in decreased elongation rates that are sensi- 
tive to the levels of S6K, but not 4EBP1 (Faller et al., 2015). 

Regulation by tRNAs 

In the previous examples, we have seen that mRNA-specific 
regulation might be achieved by turning translational elonga- 
tion on or off. In addition to these specialized cases, some evi- 
dence hints that the control of translation elongation might 
broadly occur and be critical for gene regulation. Unlike the 
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Figure 3. Illustration of the Concept of 
Optimality 

(A) Illustration of codon optimality showing two 
hypothetical mRNAs. The green circles represent 
optimal codons, whereas the red circles represent 
non-optimal codons. Designation of codons as 
optimal or non-optimal is a function of the con- 
centration of tRNA in the cell. The green tRNA 
concentrations are high, whereas the red tRNA 
concentrations are low; this difference impacts the 
speed of elongation. 

(B) Differentiated and proliferative cells have var- 
ied concentrations of tRNAs that are tailored to the 
decoding requirements of the expressed mRNAs. 
In this example, differentiated cells have an 
abundance of certain types of tRNAs (depicted in 
blue) because the mRNAs they express require 
similarly high levels of the corresponding codons. 
Conversely, proliferating cells contain different 
sets of tRNAs (in orange) to match the codons in 
mRNAs enriched in these cells (see Gingold et al., 
2014). 



of codons within the genome. For 
instance, commonly occurring codons 
can be classified as optimal or non- 
optimal, whereas uncommon codons 
can also be optimal or non-optimal with 
respect to their influence on translational 
elongation rate (Presnyak et al., 2015). 

Conceptually, codon optimality reflects 
the balance between the supply of 
charged tRNA molecules and their de- 
mand imposed by the concentration of codons engaged in trans- 
lation (Figure 3). Thus, the ribosome decodes some codons 
quickly because their cognate tRNAs are abundant, whereas 
other codons are read more slowly because their tRNA concen- 
trations are more limiting (Tuller et al., 201 0; Novoa and Ribas de 
Pouplana, 2012). In addition, codon optimality is based some- 
what on the accuracy of tRNA anticodon/codon interactions, 
which can influence decoding rate (Akashi, 1994; Drummond 
and Wilke, 2008). The theory that each codon is read by the ribo- 
some at subtly distinct rates would predict that the kinetics of 
protein synthesis are determined by the primary sequence of 
every gene (Presnyak et al., 2015). Thus, the overall elongation 
rate is the sum of each codon’s infinitesimally small effect on 
ribosome translocation. Recent support for this hypothesis has 
come from several labs demonstrating that codon optimality is 
a powerful determinant of both mRNA translation elongation 
and mRNA stability, which are tightly coupled events (Pechmann 
and Frydman, 2013; Presnyak et al., 2015). 

If codon content dictates translational elongation rate, then 
perhaps it can be regulated by changing the functional concen- 
tration of tRNAs. If tRNA pools change in response to stress, 
environmental conditions, or other biological cues, then so 
would the rate at which a codon is read and thus ultimately ribo- 
some translocation rates. Recent studies demonstrate that tRNA 
levels and modifications indeed fluctuate in response to biolog- 
ical cues. First, Gingold et al. (2014) demonstrated that tRNA 
pools fluctuate in over 470 tumor samples when compared to 
quiescent cells. Specifically, a subset of tRNAs is induced in 




aforementioned binary switches, ribosome translocation rates 
may subtly influence the expression of all messages. If transloca- 
tion rates are mRNA specific, then elongation would play a 
pivotal role in the synthesis, folding, and perhaps function of all 
proteins. The hypothesis that each mRNA has a distinct elonga- 
tion rate is based on the notion of supply and demand: supply 
of functional tRNAs and demand by expressed codons. In this 
light, tRNAs are implicated as critical regulators of the expressed 
transcriptome. 

In 1968, Francis Crick referred to the degeneracy of the ge- 
netic code as a “frozen accident” based on the required 64 com- 
binations needed to code for 20 amino acids (Crick, 1 968). Since 
then, a prevailing Zeitgeist has been that synonymous codon 
substitutions are silent, having no bearing on gene function. Anti- 
thetically, a growing body of literature suggests that synony- 
mous codons are differentially recognized by the translational 
apparatus. This concept has been referred to as codon opti- 
mality (Reis et al., 2004; Novoa and Ribas de Pouplana, 2012; 
Pechmann and Frydman, 2013; Krisko et al., 2014). Codon opti- 
mality should not be confused with codon usage or bias. Codon 
usage/bias is the overrepresentation of certain codons within the 
genome and is the result of numerous selective pressures, 
including translational elongation rate, translational accuracy, 
splicing, and 5' UTR structure (Akashi, 1994; Parmley et al., 
2006; Drummond and Wilke, 2008; Gu et al., 2010). The term 
codon optimality has been introduced in an attempt to specif- 
ically define the differential recognition of codons by the transla- 
tional apparatus and is a property that is distinct from the usage 
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proliferating cells and repressed in quiescent cells. Moreover, a 
distinct subset of tRNAs is repressed in proliferating cells that are 
active in quiescent cells. Importantly, the tRNAs that are induced 
during proliferation often have anticodons corresponding to co- 
dons enriched in cell-autonomous genes. In contrast, tRNAs 
induced in differentiated cells carry anticodons for codons en- 
riched in mRNAs for cell adhesion, cell-junction assembly, toll- 
like receptor signaling, and extracellular matrix genes. In a study 
in which mouse embryonic development was examined, Schmitt 
et al. (2014) demonstrated that tRNA expression patterns were 
controlled to generate an anticodon pool that corresponds to 
the codon demand by mRNAs (Schmitt et al., 2014). Thus, the 
patterns of tRNA expression match codon usage within the ex- 
pressed transcriptome (Figure 3). 

Stress can also alter the level of functional tRNAs within a cell. 
Specifically, reprogramming of tRNA modifications occurs in 
cells exposed to different conditions. Chan et al. (2012) demon- 
strated that exposure of cells to hydrogen peroxide results in an 
increase in the amount of tRNA'-®^^^'^^ containing 5-methylcyto- 
sine (m^C) at the wobble position. This increase in m^C causes 
selective translation of mRNAs enriched in the TTG codon. A 
nutrient-driven tRNA modification has also been observed in 
Drosophila. Here, the bioavailability of queuine (a modified 
base) affects the levels of queuosine-modified tRNAs (Zaborske 
et al., 2014). Queuine is scavenged by eukaryotes from the 
tRNAs of bacteria and absorbed in the gut where, at least in flies, 
it alters translation profiles. Together, these data suggest that 
tRNA pools can and do fluctuate in response to biological 
cues. If these concepts occur more broadly, then it places tRNAs 
as important and under-appreciated regulators of mRNA post- 
transcriptional regulation. 

Why Regulate at Elongation? 

The regulation of translational elongation might afford several 
advantages to an mRNA and the cell. First, loading an mRNA 
onto polyribosomes and then stalling the polysomes in 
response to the presence or absence of a stimulus would allow 
instantaneous production of new polypeptides once the stim- 
ulus was changed. This rapid response might be especially 
important in situations where, as in neurons, immediate protein 
synthesis is needed in response to synaptic stimulation. Lod- 
ging mRNA on translationally quiescent polyribosomes might 
also serve a protective function, limiting access by nucleases. 
In the cell, RNAs tend to be degraded when not associated 
with protein factors; therefore, unless there are active events re- 
organizing the mRNA out of translation and into a translation- 
repressed ribonucleoprotein complex (e.g., stress granules or 
maternal mRNA storage granules), polyribosomes might serve 
a protective role. 

Basal regulation of elongation rate by tRNAs might in fact be a 
primary driver of protein levels within the cell. It was recently 
observed that tightly coordinated optimal codon content occurs 
in genes encoding proteins with common physiological function 
(Presnyak et al., 2015). This finding suggests that there is evolu- 
tionary pressure toward certain synonymous codon usage to co- 
ordinate gene expression at the level of protein synthesis and 
mRNA decay. The coordination of protein complexes through 
coordinate codon-dependent elongation rates would provide 



an elegant mechanism to ensure a consistent stoichiometric 
relationship between all members of a given complex. Because 
this coordination is based on codon choice, changing tRNA 
levels and/or modifications would provide a simple yet sophisti- 
cated means to uniformly regulate an entire physiological pro- 
cess by changing ribosome elongation rates. 

The Road Ahead 

The advent of ribosome profiling (Ingolia et al., 2009), which dis- 
plays stalled ribosomes on an mRNA-specific and codon-spe- 
cific basis, has firmly placed elongation on the map of important 
gene regulatory mechanisms. Although, in most cases, the 
salient molecular details for stalling are just beginning to come 
into focus, perhaps some general principles may be involved. 
One is exemplified by FMRP in which a protein acts on a group 
of mRNAs to stall ribosome translocation. Certainly, intrinsic 
mRNA sequence gives specificity to the stalling, but is there 
also something intrinsic to the ribosome that allows it to be 
stalled? For example, we know the ribosome has trouble with 
lysine AAA codons (Koutmou et al., 2015) and that polyproline 
in the exit tunnel can also slow elongation (Gutierrez et al., 
2013). Thus, certain ribosome characteristics could be exploited 
by an mRNA to slow elongation, and consolidation of this event 
could occur through a second factor such as FMRP. Extrapola- 
tion of the data from several studies on FMRP suggests that this 
could be the case and that both message and ribosome 
contribute to the stalling. 

A second general principle centers on tRNA. The astonishing 
study of Ishimura et al. (2014) shows that the brain contains a 
tRNA that is absent from other tissues. When this tRNA has a 
single base change, ribosomes stall at its cognate mRNA 
codon. This stall results in CNS-specific pathology in instances 
where GTPBP2 is also mutated. Is tRNA specificity wide- 
spread, and if so, how does it influence health and disease? 
A corollary of tRNA tissue specificity is “optimality”— the 
matchup between codon prevalence and relative of abundance 
of the tRNA bearing the anticodon. If this ratio becomes 
skewed during development or times of stress, it is easy to 
see how it could alter ribosome translocation and result in a 
biological transition. 

Therapeutics aimed at the translational landscape have, for 
the most part, concentrated on initiation (e.g., Pelletier et al., 
2015). Importantly, however, just this year, a novel multiple- 
stage antimalarial agent was discovered whose target is Elonga- 
tion Factor 2 (Baragaha et al., 2015). Thus, we wonder whether 
the elongation phase of protein synthesis is an equally promising 
target for therapeutic intervention to ameliorate disease. 
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SUMMARY 

The ability to continuously adjust posture and bal- 
ance is necessary for reliable motor behavior. Vestib- 
ular and proprioceptive systems influence postural 
adjustments during movement by signaling function- 
ally complementary sensory information. Using viral 
tracing and mouse genetics, we reveal two patterns 
of synaptic specificity between brainstem vestibular 
neurons and spinal motor neurons, established 
through distinct mechanisms. First, vestibular input 
targets preferentially extensor over flexor motor 
pools, a pattern established by developmental refine- 
ment in part controlled by vestibular signaling. 
Second, vestibular input targets slow-twitch over 
fast motor neuron subtypes within extensor pools, 
while proprioceptors exhibit inversely correlated 
connectivity profiles. Genetic manipulations affecting 
the functionality of proprioceptive feedback circuits 
lead to adjustments in vestibular input to motor 
neuron subtypes counterbalancing the imposed 
changes, without changing the sparse vestibular 
input to flexor pools. Thus, two sensory signaling 
systems interact to establish complementary synap- 
tic input patterns to the final site of motor output 
processing. 

INTRODUCTION 

Descending motor control pathways are essential to regulate 
spinal circuits involved in movement (Grillner and Dubuc, 1988; 
Lundberg, 1975). Specificity of synaptic connections between 
upper motor control centers and the spinal output system pro- 
vides the anatomical substrate to implement movement variety 
and precision. As animals grow up, they engage in progressively 
more diverse and refined motor behaviors, paralleling the 
establishment of functionally mature descending input to spinal 
circuits. Despite the importance of this descending connection 
matrix, the organization of its key components and especially 
the elucidation of developmental mechanisms involved in its 
establishment are still under intense investigation. 

The ability to continuously adjust posture and balance during 
movement matures at postnatal stages in mammals (Brown, 

CrossMark 



1 981 ; Geisler et al., 1 993). Due to the importance of these adap- 
tive mechanisms for the execution of highly diverse motor 
programs, circuits steering body stabilization must exhibit a 
high degree of tuning flexibility. Two parallel and functionally 
complementary sensory signaling systems play key roles in 
this process. In the vestibular system, one central sensory organ 
in the inner ear monitors linear and rotational acceleration and 
provides input to the vestibular nucleus of the brainstem (Ange- 
laki and Cullen, 2008; Brodal and Pompeiano, 1957). Descend- 
ing vestibulo-spinal projection neurons transmit this information 
to spinal circuits to provide postural stability (Grillner et al., 1 970; 
Lund and Pompeiano, 1968; Shinoda et al., 1988; Wilson and 
Yoshida, 1968). The somatosensory system represents a com- 
plementary signaling system in which sense organs are distri- 
buted throughout the entire body (Abraira and Ginty, 2013; 
Brown, 1981; Matthews, 1981). Within this system, propriocep- 
tive sensory neurons located in dorsal root ganglia (DRG) 
monitor self-generated actions and extrinsic perturbations in 
the periphery. Of these, muscle spindle afferents report the state 
of muscle contraction from specific sites in the periphery directly 
to spinal motor neurons through monosynaptic reflex arcs 
(Brown, 1981; Eccles et al., 1957; Windhorst, 2007). 

Revealing the organization of synaptic connections to spinal 
motor neurons is crucial to understand how vestibular and pro- 
prioceptive information influences motor output. Studies in the 
adult cat provide the first evidence that vestibular neurons 
preferentially target extensor motor neuron pools (Grillner 
et al., 1970). In contrast, proprioceptors contact motor neurons 
of most pools in the spinal cord. A motor pool receives direct 
synaptic input from muscle spindle afferents supplying the 
same or synergistic muscles, but not from afferents innervating 
antagonistic muscles (Eccles et al., 1957; Mears and Frank, 
1997). Thus, both extensor and flexor motor neuron pools get 
direct proprioceptive input but in highly specific configurations, 
whereas direct vestibular input seems to be preferentially tar- 
geted to extensor motor neurons in line with its body-stabilizing 
and anti-gravitational function. Beyond their connectivity pro- 
files, vestibular and proprioceptive systems also interact func- 
tionally with each other and can contribute to both enhancement 
or depression of responses in motor neurons (Grillner et al., 
1970). 

Less is known about the mechanisms guiding developmental 
assembly of these two sensory systems. Specific connectivity 
between proprioceptors and motor neurons in the same reflex 
arc is already present at early postnatal developmental stages 
in mice (Mears and Frank, 1997) and activity-independent in 
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Figure 1 . Spatial Distribution of Vestibular 
Neurons Regulating Spinal Motor Neurons 

(A) Unilateral injections of G-protein-deleted 
Rabies viruses encoding fluorescent proteins 
(FP) into cervical and lumbar spinal cord to 
assess the position of vestibular neurons in the 
brainstem. 

(B) Representative coronal sections at the level 
of the lateral vestibular nucleus (LVe; left) and 
spinal vestibular nucleus (SpVe; right) ipsilateral to 
injection. 

(C) Three-dimensional model of the vestibular 
nucleus used for digital reconstructions, sur- 
rounded by cranial motor nuclei 5N and 10/12N. 

(D) Ipsilateral side view of digital 3D vestibular 
nucleus reconstructions derived from lumbar 
(left) and cervical (middle) spinal injections 
(colored neurons reside in, gray neurons outside 
LVe). Pie charts in upper right comers show 
percentages of LVe neurons. Right: quantifica- 
tion of lumbar and cervical projection neuron 
composition in Ve nucleus, stratified by ipsi- 
and contralateral as well as LVe and non-LVe 
residence. 

(E) Strategy for monosynaptic rabies tracing 
experiments to determine connectivity between 
vestibular and motor neurons. 

(F) Side- (left) and top-down (middle) view of 
vestibular nucleus ipsilateral to muscle injection, 
depicting the position of vestibular neurons con- 
nected to FL- (cyan) and HL- (purple) innervating 
motor neurons. Density curves for HL and FL 
premotor neuron distributions along the dorso- 
ventral axis superimposed to the ipsilateral side- 
view panel. Right: quantification of positional 
distribution as in (D). 

(G) Genetic strategy to mark LVe neurons 
by developmental origin. R4::Cre mice are 
crossed to Tau-reporter mice for conditional 

expression of nIs-LacZ and FLPo expression to assess the percentage of premotor vestibular neurons marked by R4-origin. 

(H) Most HL- or FL-premotor (Rabies^”^) neurons in LVe are marked by R4::Cre-\n6uced LacZ, whereas premotor neurons residing in SpVe do not carry this tag 
(left, middle: exemplary images; right: quantification). 
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frogs (Frank, 1990). In the vestibular system, transient develop- 
mental perturbations affect motor behavior in several species 
(Geisler and Gramsbergen, 1998; Moorman et al., 2002; Van 
Cleave and Shall, 2006; Walton et al., 2005), raising the possibil- 
ity that the assembly of the vestibular system might be plastic. 
Together, these observations provide first hints that even 
though proprioceptive and vestibular systems both functionally 
converge on motor neurons, their organization and develop- 
mental assembly mechanisms might be distinct. Moreover, 
whether and how they influence each other to establish mature 
functionality is unknown. 

In this study, we exploit intersectional viral tracing technology 
and mouse genetics to reveal that vestibulo-spinal projection 
neurons in the brainstem exhibit connection specificity to motor 
neurons. We demonstrate that they do not only target extensor 
over flexor motor neuron pools, but that within extensor pools, 
they preferentially connect to slow over fast motor neurons. 
We find that connectivity profiles arise gradually at postnatal 
developmental stages, paralleling postural maturation. Genetic 
perturbation of vestibular signaling leads to interpool connecti- 



vity defects, whereas proprioceptive feedback circuit alterations 
induce specific connectivity shifts in synaptic scaling of vestib- 
ular input to motor neuron subtypes. These findings support a 
model in which two major sensory signaling systems interact at 
the final motor output step to establish specific connectivity pro- 
files by complementary cross-modal signaling. 

RESULTS 

Spatial Organization of Spinal Projection Neurons in the 
Vestibular Nucleus 

To delineate the position of vestibular neurons with spinal projec- 
tions, we performed unilateral intraspinal injections of G-protein- 
deficient rabies viruses encoding fluorescent marker proteins 
(FP) (Rab-FP) (Wickersham et al., 2007) (Figures 1A and IB). 
We found that in a three-dimensional digital brainstem model 
(Figure 1C), vestibular neurons with lumbar projections were 
preferentially located ipsilaterally, with dominant residence 
within the lateral vestibular (LVe) nucleus and with a clear spatial 
segregation to a caudal cluster of non-LVe neurons that were 
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bilaterally distributed (Figure 1D; including spinal vestibular 
neurons [SpVe]). Vestibular neurons projecting to cervical spinal 
levels also showed clear, albeit less pronounced ipsilateral resi- 
dence within LVe, but occupied the vestibular nucleus continu- 
ously into caudal non-LVe territory (Figure 1D). In summary, for 
both lumbar and cervical vestibular projection neurons, LVe neu- 
rons exhibited a strong ipsilateral bias (Figure 1 D). These findings 
confirm and extend findings that subgroups of mouse vestibular 
neurons exhibit differential projection trajectories to interact with 
local circuits in the spinal cord (Liang et al., 2014, 2015). 

To determine the identity of vestibular neurons exerting the 
most direct influence on spinal motor neurons, we next assessed 
abundance and position of vestibular neurons with direct synap- 
tic connections to motor neurons. We used a transsynaptic 
rabies virus-based approach with monosynaptic restriction 
(Wickersham et al., 2007) (Figure IE). The majority of neurons 
with direct connections to lumbar motor neurons resided in 
the ipsilateral LVe nucleus, with the highest density peak more 
dorsal to neurons with connections to cervical motor neurons 
(Figure 1 F). 

To gain genetic access to neurons in the LVe nucleus, we 
applied a lineage tracing approach for neurons developmentally 
derived from different rhombomeric (R) origin (Figure 1G). To 
permanently mark R4-derived neurons, we used intersectional 
breeding of R4::Cre mice (Di Bonito et al., 2013) and the condi- 
tional neuronal reporter strain jquIox-stop-iox-fip-inla 

et al., 201 4). This strategy labeled the majority of lumbar-projec- 
ting LVe neurons, but the R4-marker was entirely excluded from 
non-LVe neurons (Figure 1H), which are derived from more 
caudal rhombomeres (data not shown). Together, the existence 
of the clearly delineated ipsilateral cluster of vestibular neurons 
in the LVe nucleus and the access to specific targeting ap- 
proaches allowed us to next dissect projection trajectory and 
connection specificity of these neurons to lumbar spinal circuits 
with precision. 

Lateral Vestibular Synaptic Input Is Biased to Extensor 
Motor Neurons 

To reveal the descending projection trajectory and the synaptic 
arborization pattern of LVe neurons to the lumbar spinal cord, 
we performed focal injections of adeno-associated viruses 
(AAV) into the LVe nucleus. We used AAVs expressing 
tdTomato for axonal tracing and/or a fusion protein between 
synaptophysin and GFP or Myc (Syn-Tag) for synaptic recon- 
structions (Figures S1A-S1C). We found that axons descending 
from the LVe nucleus to the lumbar spinal cord were confined 
to ipsilateral white matter tracts (Figure SID), consistent with 
previous experiments (Liang et al., 2014). Analysis of Syn-Tag 
distribution in the lumbar spinal cord revealed the highest 
density of synaptic terminals in lamina VIII ipsilateral to injection 
(Figures SID and S2). Many Syn-Tag puncta were also de- 
tected throughout the ipsilateral ventral spinal cord below 
the central canal including lamina IX containing ChAT^”^ motor 
neurons (Figures SID and S2). A similar distribution pattern 
was observed upon AAV-FRT-Syn-Tag LVe injection in R4::Cre! 
Jauiox-STOP-Iox-Fip-INLA (Figure S2). Moreover, and similar 

to findings in the rat (Du Beau et al., 2012), the majority 
of Syn-Tag^*^ terminals accumulate the vesicular glutamate 



transporter vGlut2 (74.6%; Figure S1E), demonstrating that 
LVe spinal projection neurons provide excitatory input to the 
lumbar spinal cord. 

We next assessed whether LVe input to the lumbar spinal cord 
exhibits synaptic specificity with respect to the identity of con- 
tacted motor neurons. We combined LVe AAV-Syn-Tag injec- 
tions with retrograde tracing of motor neurons from identified 
hindlimb muscles (Figures 2A and SI A). We analyzed LVe input 
to motor neurons pools innervating the ankle extensor gastroc- 
nemius (GS) and the ankle flexor tibialis anterior (TA), due to their 
functional antagonism as well as previous evidence for GS- 
biased vestibular synaptic input in the cat (Grillner et al., 1970). 
Vestibular input was also strongly biased toward the GS 
compared to the TA motor neuron pool in mice, a bias detected 
irrespective of cell body or dendritic analysis of reconstructed 
GS/TA motor neurons (Figures 2A and 2B). 

Lateral Vestibular Input Avoids GSL1 Motor Neuron 
Subtypes 

Despite this striking difference in overall input between GS and 
TA motor neurons, we noted that LVe synaptic input to individual 
GS motor neurons was highly variable. While some GS motor 
neurons received low (GS-low) LVe input, others were targeted 
by high-density (GS-high) LVe input (Figures 2A and 2B). These 
findings suggest that not all GS motor neurons are equally 
favored targets for vestibular input and raise the question of 
the underlying reason for this variability. 

Most skeletal muscles are composed of a mixture of different 
fiber types innervated by three functionally matched alpha motor 
neuron subpopulations. These motor neuron subtypes are differ- 
entially recruited during movement and include fast fatigable 
(FF), fatigue resistant (FR), and slow motor units (Burke, 1967; 
Kanning et al., 201 0). In the mouse, the most lateral subcompart- 
ment of the lateral GS muscle (GSL1 ) is a very valuable exception 
to this rule in that it is innervated exclusively by FF motor neurons 
(Pun et al., 2006). This property allowed us to assess LVe input 
specifically to FF motor neurons within the GS motor pool (Fig- 
ure 2C). We found that GSL1 FF motor neurons received only 
low-density LVe input and notably significantly less than the 
entire GS motor pool (Figures 2C and 2D). In addition, cell 
body volume values of motor neurons innervating the GSL1 
compartment have a tendency to accumulate in the upper 
two-thirds of the distribution spectrum (Figure 2E). Nonetheless, 
and consistent with previous observations (Burke et al., 1982), 
such size range classifications are not sufficient to unambigu- 
ously assign motor neuron subtype identity. In summary, GSL1 
FF motor neurons receive low-density LVe input, raising the pos- 
sibility that this input is preferentially targeted to specific motor 
neuron subtypes within extensor pools. 

LVe Input Prefers Molecularly Defined Slow Motor 
Neurons in Extensor Pools 

We next aimed to generalize our finding that LVe inputs might 
prefer slow motor neuron subtypes. Recent observations 
demonstrate that chondrolectin (ChodI) and matrix metallopro- 
tease-9 (MMP-9) are expressed by fast motor neurons (Enjin 
et al., 2010; Kaplan et al., 2014; Leroy et al., 2014). In mice ex- 
pressing the membrane marker protein placental alkaline 
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Figure 2. Vestibular Input Stratifies by 
Motor Neuron Subtype and Size 

(A) Digital reconstruction and quantification of LVe 
synaptic input density to GS and TA motor neurons 
(each dot represents one motor neuron). 

(B) Representative Neurolucida reconstructions 
of GS/TA motor neurons and LVe synaptic input 
(yellow). GS examples with high and low input 
density are shown. 

(C) Motor neuron subtype composition of GS 
and GSL1 motor pool stratified into slow (S), FR 
(fatigue resistant), and FF (fast fatigable) subtypes. 

(D) Quantification of synaptic density of LVe input 
to GS and GSL1 motor neurons (each dot repre- 
sents one motor neuron). 

(E) Synaptic density of LVe input to analyzed motor 
neurons plotted against cell body volumes. 

See also Figures SI and S2. 
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phosphatase (FLAP) from the ChodI locus (Chodl^‘~^^ (Sakurai 
et al., 2013), a large majority of ChodP'^ lumbar ChAT*^'^ motor 
neurons in the lateral motor column (LMC) coexpressed MMP- 
9 (92%), and all GSL1 FF motor neurons were PLAP°^/MMP-9°^ 
(Figures 3A-3C). We first quantified LVe synaptic input density 
to lumbar LMC motor neurons overall in C/?oc//^^^^ mice. Strati- 
fication of LVe input density by PLAP^*^ and PLAP^'^'^ status of 
targeted LMC motor neurons revealed significantly lower input 
density to PLAP^*^ than putative alpha PLAP^^*^*^ motor neurons 
(Figures 3D and 3E). Furthermore, there was a significant inverse 
correlation between LVe synaptic input density and motor 
neuron cell body volume (Figure 3E). 

To determine whether the uncovered LVe synaptic input rule 
based on Chodl/MMP-9 stratification also applies to motor 
neuron subtypes within a given extensor motor pool other than 
GS (Figures 2C-2E), we analyzed two more motor pools inner- 
vating extensor muscles. The ankle extensor muscle soleus 
(Sol) is innervated by an approximately equal number of slow 
and FR motor neurons in mice, but does not contain any FF mo- 
tor neurons (Kaplan et al., 201 4; Pun et al., 2006). Analysis of LVe 
input density to Sol motor neurons stratified by MMP-9 status re- 
vealed significantly lower values for MMP-9°^ than MMP-9^'^'^ 
Sol motor neurons (Figure 3F). Additionally, since the Sol motor 
pool does not contain any FF motor neurons (Pun et al., 2006), 
these findings indicate that slow motor neurons are not only a 
preferred LVe target over FF, but also over FR motor neurons. 
We next assessed LVe input to MMP-9 stratified motor neurons 
innervating the hip extensor gluteus (GL) and found that also for 
this pool, MMP-9°^ populations received significantly lower 



input than the MMP-9°^^ cohort (Fig- 
ure 3F). Moreover, the corresponding 
functionally antagonistic hip flexor (Ilio- 
psoas) motor pool showed LVe input 
density values similarly low as to TA flexor 
motor neurons (Figure 3F), thus general- 
izing our findings to other motor neuron 
pools. 

Together, our experiments support a 
model in which synaptic input specificity 
of LVe neurons to lumbar LMC motor neurons is organized at 
different levels (Figure 3G). First, LVe axons seek out extensor 
over flexor motor pools as preferred synaptic targets in agree- 
ment with previous work (Grillner et al., 1970). Second, LVe syn- 
aptic contacts preferentially target slow over fast motor neuron 
subtypes within an extensor pool. These findings raise the ques- 
tion of how this synaptic specificity arises during development 
and what may be factors regulating its establishment. 

Developmental Refinement of Vestibular Synaptic Input 
Specificity to Motor Neurons 

To assess synaptic input specificity of LVe neurons to lumbar 
motor neurons during development, we carried out spatially 
confined injections of AAV-Syn-Tag into the LVe nucleus early 
postnatally and retrogradely labeled GS or TA motor neurons 
(Figure 4A). The earliest time point for which it was technically 
possible to achieve consistent high-level Syn-Tag accumulation 
from LVe neurons in the lumbar spinal cord was P7. GS motor 
neurons at P7 receive synaptic input at densities similar to adult 
(Figure 4B). However, while the difference in input density be- 
tween GS and TA motor neurons was already established at 
P7, LVe terminals frequently contacted TA motor neurons at an 
overall significantly higher input density than in the adult (Fig- 
ure 4B). To assess during which time window the transition to 
mature connectivity profiles emerges, we carried out synaptic 
input mapping at progressively later developmental time points 
(Figure 4A). We found that LVe neurons still contact TA motor 
neurons at P1 1 , but that developmental refinement was com- 
plete by PI 7 (Figure 4B). 
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Figure 3. Vestibular Input Preferentially Tar- 
gets Putative Slow over Fast Motor Neurons 

(A and B) ChAT^^/NeuN^^ alpha motor neurons in 
the lumbar spinal cord of Chodl'^'-^'^ mice frac- 
tionate into Chodl^"^ and Chodl°'^'^ population, of 
which the Chodl°'^ neurons also express MMP-9. 

(C) Retrogradely marked GSL1 FF motor neurons 
express Chodl. 

(D) Density of LVe synaptic input to PLAP^”^, 
PLAP^*^*^ (all, or excluding putative gamma motor 
neurons using a size cut-off criterion of 5,000 |am^) 
motor neurons. 

(E) Synaptic density of LVe input to PLAP^”^ and 
putative alpha PLAP°'^'^ motor neurons analyzed in 
(D) plotted against cell body volumes (r = -0.548, 
p = 0.0005). 

(F) Analysis of LVe input density for gluteus 
and soleus motor neurons stratified by MMP-9 
expression status (cyan, MMP-9°'^; gray, MMP- 
9°'^'^). Input to antagonistic hip flexor muscle 
iliopsoas (IP) is also shown. 

(G) Summary diagram of synaptic specificity be- 
tween LVe and motor neurons. LVe preferentially 
targets extensor over flexor motor pools (top, in- 
terpool specificity) and within extensor pools 
preferentially slow over fast motor neuron sub- 
types (bottom, intrapool specificity, green dots 
represent synapses). 
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To determine whether LVe contacts to TA motor neurons at 
early postnatal stages represent synaptic contacts, we applied 
monosynaptic rabies viruses to muscles innervated by GS and 
TA motor neurons (Figure 4C). We found that LVe neurons con- 
nect to both GS and TA motor pools at these stages, but signif- 
icantly more LVe neurons were labeled after GS than TA muscle 
injections, at a ratio comparable to the anterograde synaptic 
density measurements at P11 (Figures 4D and 4E). Together, 
these data confirm our anterograde tracing results, demon- 
strating that initial developmental synaptic contacts to TA motor 



Perturbing Vestibular Signaling 
Affects Establishment of Interpool 
Synaptic Specificity 

To elucidate the mechanisms by which 
selectivity of vestibular input to spinal mo- 
tor neurons is established, we used two 
different genetic models in the mouse ex- 
hibiting altered vestibular neuron signaling. 
We asked how these perturbations influ- 
ence the establishment of mature connec- 
tivity profiles between vestibular neurons 
and spinal motor neurons. 

We first analyzed NADPH oxidase 3 
(Nox3) mutant mice (Figure 5A). These 
mice lack mineralized particles called 
otoconia in the inner ear’s utricle and 
saccule, leading to selective defects in perception of gravity 
and linear acceleration, but they exhibit intact semicircular canal 
vestibular as well as auditory sensory inputs (Paffenholz et al., 
2004). Of the five known vestibular input channels, predomi- 
nantly utricular or posterior semicircular canal nerve activation 
influences lumbar spinal circuits through the lateral vestibular 
tract (Uchino and Kushiro, 2011). Nox3 mutant mice therefore 
exhibit congenitally altered LVe input to the lumbar spinal cord, 
lacking information derived from the utricular sensory input 
channel but not from semicircular canals. 
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We first determined whether Nox3 mutation affects the estab- 
lishment of LVe synaptic inputs to the functionally antagonistic 
motor neuron pools GS and TA. We found that there was no dif- 
ference in LVe input density to GS motor neurons between wild- 
type and Nox3 mutant mice, but that TA motor neurons received 
LVe input at a significantly higher density in Nox3 mutant than 
wild-type mice (Figure 5B). When we compared LVe synaptic 
input density at P11 , a time point before mature connectivity pro- 
files are reached in wild-type mice, LVe input to TA motor neu- 
rons was not different between wild-type and Nox3 mutant 
mice (Figure 5B). Moreover, between P1 1 and adult stages in 
Nox3 mutant mice, no significant refinement of LVe input to TA 
motor neurons occurred (Figure 5B). Together, these findings 
demonstrate that LVe neurons maintain aberrant synaptic input 
to flexor motor neurons when otholitic vestibular signaling is 
non-functional. 

We next asked whether utricular vestibular signaling also influ- 
ences LVe connectivity profiles to fast and slow motor neuron 
subtypes. There was no significant difference in LVe synaptic 
input to FF GSL1 motor neurons between wild-type and Nox3 
mutant mice (Figure 5C). We also analyzed LVe input density 
to Sol motor neurons stratified by MMP-9 status to distinguish 
between fast (MMP-9°^) and slow (MMP-9^'^'^) motor neuron 
subtypes. While Nox3 mutant mice still exhibited clear intrapool 
differences to these motor neuron subtypes, the connectivity 
stratification was less pronounced than in wild-type mice (Fig- 
ure 5C). Together, these findings suggest that Nox3 mutants 
exhibit defects in interpool but no major intrapool LVe synaptic 
connectivity. 

We next analyzed an intersectional mouse mutant in which the 
synaptic output of most LVe neurons is functionally muted from 
the earliest developmental stages. This genetic strategy is based 
on our observations that most LVe neurons projecting to lumbar 



Figure 4. Developmental Refinement of 
Vestibular Input to Lumbar Motor Neurons 

(A) Experimental approach used and timeline. 
AAV-Syn-Tag is injected into LVe of P3 (or adult) 
mice, followed by retrograde marking of GS or TA 
motor neurons by muscular tracer injections. 

(B) Synaptic density of LVe input to GS and TA 
motor neurons at P7, P11 , P1 7, and adult stages. 
(C-E) Monosynaptic rabies tracing experiment at 
P3. Quantification of marked neurons in LVe and 
non-LVe territory (see Figure 1), both ipsi- (i) and 
contra- (c) lateral to muscle injection (normalized to 
rabies neuron number in Magnocellular nucleus). 
Side- and top-down view of ipsilateral vestibular 
reconstruction depicting GS (purple) and TA (cyan) 
vestibular neurons shown in (E). Neurons connected 
to GS or TA motor neurons were intermingled and no 
spatial segregation was discernable (E). 

(F) Summary diagram illustrating developmental 
refinement process of LVe input to GS and TA 
motor neurons. 



spinal levels are of developmental 
rhombomeric origin R4 and express the 
glutamate transporter vGlut2. In agree- 
ment, genetic elimination of vGlut2 from 
R4-derived LVe neurons (R4^''^::vGlut2^'°'' mice) abolishes vGlut2 
protein from the vast majority of spinal synapses derived from 
LVe neurons (Figures 5A and S3). 

Since R4^’'^::vGlut2^'°^ mice have not been characterized 
before, we determined whether they exhibit motor behav- 
ioral deficiencies compatible with impaired LVe function. 
R4^re::vGiut2^'°^ mice executed open field navigation, grip 
strength, and horizontal ladder tasks similar to wild-type mice 
(Figure 5D). In contrast, they exhibited defects in tasks predicted 
to profoundly engage the vestibular system. R4^''^::vGlut2^'°^ 
mice walking on a narrow beam showed significantly more slips 
than wild-type mice, and this phenotype was particularly pro- 
nounced on 6 mm- over 12 mm-wide beams (Figure 5D). These 
behavioral experiments suggest that elimination of vGlut2 from 
R4-derived LVe neurons affects vestibular function and leads to 
motor defects attributable to such perturbations. 

We next assessed synaptic input to TA motor neurons in these 
mice and found a significantly higher synaptic input density 
compared to wild-type (Figure 5B), similar to the phenotype in 
Nox3 mutant mice. Lastly, also similar to our observations in 
Nox3 mutant mice, we found no differences in LVe synaptic input 
to GSL1 and Sol motor pools in R4^’'^::vGlut2^'°^ compared to 
wild-type mice (Figure 5C). 

In summary, genetic perturbation of selective vestibular input 
channels or muting synaptic output of vestibular neurons result 
in similar connectivity defects between LVe neurons and flexor 
motor neurons (Figure 5E). Our observations also reveal that 
additional factors must play important roles in scaling vestibular 
input specificity to motor neuron subtypes. Considering the es- 
tablished roles of vestibular and proprioceptive systems in 
posture and balance, an interesting hypothesis to test is whether 
these two systems influence each other in establishing their 
respective connection specificities to motor neurons. 
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Figure 5. Perturbation of Vestibular Input 
Channel Results in Connectivity Defects to 
Motor Neurons 

(A) Cellular phenotypes and analysis of wild-type, 
Nox3~'-, and R4^''^::vGlut2^‘°^ mice. Nox3~'- mice 
exhibit defects in otolith-organ derived vestibular 
sensory input to brainstem vestibular neurons, 
whereas R4^''^::vGlut2^'°'' mice lack functional 
output from vestibular neurons to the spinal cord. 
AAV-Syn-Tag injections are performed to quantify 
synaptic input density to motor neuron sub- 
populations. 

(B) Synaptic density of LVe input to GS and 
TA motor neurons at adult (GS, TA) and P11 
(TA) stages for wild-type and Nox3 mutant 
mice. TA motor neurons were analyzed in adult 
R4^''^::vGlut2^'°^ mice. 

(C) Synaptic density of LVe input to GSL1 and Sol 
motor neurons in wild-type, Nox3 mutant, and 
R4^''^::vGlut2^'°^ mice (top row). Data for Sol motor 
neurons in wild-type and Nox3 mutant mice dis- 
played stratified by MM P-9 expression status 
(bottom row). 

(D) Behavioral analysis of wild-type and 
R4^<'^::vGlut2^'°^ mice in open field arena (tracks of 
individual mice; quantification of track length 
moved in 10 min), grip strength, horizontal ladder 
precision (hit, slip, and miss categories displayed 
in pie chart) and beam crossing on 12 mm- and 
6 mm-thick beam. 

(E) Summary diagram of synaptic input analyzed 
between LVe and motor neuron subtypes in wild- 
type mice and vestibular mutants. Note ectopic 
synaptic input to TA motor neurons in vestibular 
mutants. 

See also Figure S3. 



Proprioceptive Signaling Influences Vestibular Synaptic 
Density to Motor Neurons 

Given the striking LVe synaptic input variation to different motor 
neuron subtypes, we first determined the organization of direct 
synaptic input by proprioceptive afferents to motor neuron sub- 
types. Of proprioceptors, only muscle spindle afferents connect 
directly to motor neurons and their synaptic terminals accumu- 
late the vesicular glutamate transporter vGlutI (Oliveira et al., 
2003; Pecho-Vrieseling et al., 2009). Analogous to our analysis 
of LVe input to motor neuron subtypes (Figure 3), we quantified 
vGlutI input density to ChodP^ putative fast motor neurons, 
ChodP'^'^ putative slow motor neurons, as well as to identified 
GSL1 (exclusively FF) and Sol (many slow) motor neurons. We 
found that vGlutI input density was higher for GSL1 andChodP*^ 
motor neurons than for Sol and ChodP'^'^ putative alpha motor 
neurons (Figure 6A), a finding opposite to our analysis of input 
densities derived from the LVe nucleus (Figures 3D and 3E). 
Moreover, cell body volumes and vGlutI synaptic input density 
were positively correlated to each other (Figure 6B), further sup- 
porting the notion that fast motor neurons with relatively large 
cell bodies receive a higher density of vGlutI inputs than smaller, 
ChodP'^'^ alpha motor neurons. Analysis of both LVe and vGlutI 
input to the same cohort of motor neurons stratified by Chodl- 
expression status and cell size confirmed this conclusion (Fig- 
ures 6C and 6D). 



To determine whether the status of proprioceptive input to a 
motor neuron influences the organization of LVe input to the 
same motor neuron, we analyzed two mouse mutants with oppo- 
site proprioceptive synaptic phenotypes to motor neurons (Fig- 
ure 7A). Egr3 mutant mice exhibit early postnatal degeneration 
of muscle spindles, leading to non-functional muscle spindle af- 
ferents (Chen et al., 2002; Tourtellotte and Milbrandt, 1998). In 
contrast, Mlc::NT3 mice overexpress NT3 from skeletal muscle 
fibers, resulting in survival of superfluous proprioceptive affer- 
ents with aberrant and more synaptic connections to central syn- 
aptic partners (Wang et al., 2007). To assess LVe input to motor 
neurons in these two mutant mouse strains compared to wild- 
type mice, we quantified synaptic input to motor neuron cell 
bodies and dendrites (Figures 7 A and 7B). 

In Egr3 mutant mice, we analyzed LVe and vGlutI input 
to GSL1 motor neurons, normally exhibiting high-vGlutI and 
low-LVe input (Figures 7C-7E). In these mice, vGlutI contacts 
to GSL1 motor neurons are present (Figures 7D and 7E) despite 
their non-functionality (Chen et al., 2002; Tourtellotte and 
Milbrandt, 1998). However, LVe input to GSL1 motor neurons 
is significantly increased in Egr3 mutant mice (Figures 7D 
and 7E). Conversely, in Mlc::NT3 mice, we analyzed LVe and 
vGlutI input to Sol motor neurons that normally receive rela- 
tively low-vGlutI and high-LVe input (Figures 7F-7H). As ex- 
pected, Sol motor neurons received significantly more vGlutI 
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input in MLC::NT3 than wild-type mice, but LVe input density 
was strongly reduced (Figures 7G and 7H). Despite these 
differences in LVe connectivity to motor neuron subtypes 
however, overall LVe synaptic patterns in the spinal cord were 
not perturbed across genotypes and injection conditions 
(Figure S4). 

To determine whether the lack of direct functional propriocep- 
tive input to motor neurons in Egr3 mutant mice also influences 
LVe input to flexor motor neurons, we next compared input to 
TA motor neurons between wild-type and Egr3 mutant mice. 
We found that there was no significant difference between geno- 
types (Figures S5A-S5D). These results demonstrate that altered 
proprioceptive signaling to flexor motor neurons cannot overrule 
the scarcity of LVe input to these neurons. Thus, the assembly 
of LVe inputs at the motor pool and motor neuron subtype level 
employs distinct developmental mechanisms. 

Lastly, to test whether the synaptic scaling of these 
two complementary sensory systems operates bidirectionally, 
i.e., whether altered LVe input scales vGlutI input to motor neu- 
rons, we analyzed vGlutI input to Sol motor neurons in Nox3 
mutant mice. We detected a striking increase in vGlutI terminals 
to Sol motor neurons in these mutants compared to wild-type 
mice (Figures S6A and S6B). This finding suggests that LVe 
signaling influences the scaling of proprioceptive inputs to motor 
neurons. 

DISCUSSION 

The control of posture and balance is essential for motor perfor- 
mance. The vestibular system plays an important role in this pro- 



Figure 6. Vestibular and Proprioceptive 
Input Anti-correlated by Motor Neuron 
Subtype 

(A) Density of vGlutI synaptic input to PLAP^”^, 
PLAP°'^'^ (aii, or exciuding putative gamma motor 
neurons using a size cut-off criterion of 5,000 |am^), 
GSL1 , and soieus motor neurons. 

(B) Synaptic density of vGiutI input to PLAP^"^ and 
putative aipha PI_AP°'^'^ motor neurons anaiyzed 
in (A) piotted against ceii body voiumes (r = 0.448, 
p = 0.0054). GSL1 and Soi motor neurons are 
aiso dispiayed in this piot but not inciuded in the 
correiation anaiysis. 

(C) Piot of vGiutI versus LVe synaptic input density 
to PLAP^”^ and putative aipha PLAP^'^'^ motor 
neurons in reiation to ceii body voiume iiiustrated 
by diameter of piotted circies. 

(D) intrapooi stratification of LVe and la proprio- 
ceptive vGlutI input to fast (F, Chodl°'^) and slow 
(S, Chodl^'^'^) alpha motor neurons, revealing anti- 
correlated synaptic input densities. 



muscle 

spindle through its ability to stabilize and 

adjust body position during movement 
(Angelaki and Cullen, 2008; Grillner 
fast slow et al., 1970; Wilson and Yoshida, 1968). 

Using genetic perturbation experiments, 
we demonstrate that signaling interac- 
tions between the proprioceptive and vestibular system play a 
key role in shaping connection specificity between vestibular 
neurons in the brainstem and spinal motor neurons. We discuss 
how these findings advance our understanding of vestibular sys- 
tem function, especially in the context of connectivity refinement 
and functional interaction with proprioceptive circuitry to ensure 
smooth motor behavior. 




Motor Neuron Subtype Identity Aligns with Synaptic 
Input Specificity 

Work on the cat lumbar spinal cord demonstrates that select 
lumbar extensor motor pools are favored direct synaptic targets 
for LVe input compared to flexor counterparts (Grillner et al., 
1970), a profile we find to be conserved in mice. A key insight 
of our work is that the observed extensor-flexor interpool spec- 
ificity pattern is supplemented by a preference of LVe input to 
target slow over fast motor neuron subtypes within each 
extensor pool analyzed, and notably, this bias is even detect- 
able at the level of a general lumbar LMC motor neuron 
analysis. 

What may be the functional reasons behind the identified 
vestibulo-motor connectivity profile to preferentially target 
slow over fast motor neurons within extensor pools? Vestibular 
input enhances the activation of motor neurons innervating 
extensor muscles exhibiting antigravitational function and can 
produce large motoneuronal depolarizations through temporal 
summation (Grillner et al., 1970). This is physiologically relevant 
since vestibular neurons fire at high frequencies (Angelaki and 
Cullen, 2008), also detected in awake behaving mice (Beraneck 
and Cullen, 2007), demonstrating that the vestibular system has 
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the capability to contribute to nnotoneuronal recruitnnent. Our 
work shows that the vestibular systenn contributes to this pro- 
cess by preferential targeting of slow extensor nnotor neuron 



Figure 7. Muscle Spindle Signaling Influ- 
ences Input to Motor Neuron Subtypes 

(A) Synaptic input status of proprioceptors to 
aipha motor neurons in wiid-type, Egr3~'~ and 
MLC::NT3 mice. Egr3~^~ proprioceptive terminais 
are physicaiiy present but non-functionai, whereas 
they show over-proiiferation and aberrant con- 
nections in MLC::NT3 mice. 

(B) Summary diagram iiiustrating main findings. 
LVe inputs to motor neuron subtypes of extensor 
poois are affected by genetic manipuiation of 
proprioceptor input function. 

(C-H) Neuroiucida reconstruction (C and F) and 
quantification (D, E, G, and H) of LVe and vGiutI 
synaptic input to GSL1 and soieus motor neurons 
in wiid-type, compared to Egr3~^~ (for GSL1) and 
MLC::NT3 (for soieus) mice both at the ceii body 
(D and G) and dendrite (D and H) ievei. in (E) and 
(H) area under curves are quantified and shown in 
bar graphs. 

See aiso Figures S4, S5, and S6. 



subtypes selectively recruited during 
endurance and postural tasks and with 
the ability to support contractions 
without fatigue (Burke, 1967; Kanning 
et al., 2010). In contrast, fast motor neu- 
rons receive sparse direct vestibular 
input, in line with these motor neurons 
being recruited during fast and powerful 
muscle contractions but to fatigue 
quickly (Burke, 1967; Kanning et al., 
2010 ). 

Our work is focused on synaptic 
input specificity directly to motor neu- 
rons, but vestibular signaling also acts 
through indirect pathways via spinal 
interneurons, and these pathways also 
employ specific connectivity rules 
following motor pool-specific patterns 
(Grillner et al., 1970). Even though flexor 
motor pools do not receive direct excit- 
atory LVe inputs, disynaptic pathways 
can specifically inhibit them and thereby 
further enhance the differential func- 
tional impact that LVe signaling ex- 
hibits on extensor and flexor motor 
pools. Electrophysiological studies on 
the organization of peripheral and ru- 
brospinal inputs to motor neurons 
demonstrate that indirect inputs can 
also exhibit fiber-type-specific func- 
tional connectivity profiles (Burke et al., 
1970). Whether indirect inputs to motor 
neurons in the vestibular system also 
motor unit twitch-type organizational 



follow the intrapool 
principle as direct ones do will be an interesting question to 
pursue. 
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Multisensory Integration in the Motor Output System 

The functionality of the motor system depends heavily on contin- 
uous integration of sensory information of different modalities. 
Multisensory inputs influence many neuronal elements along 
motor output pathways, a general organizational principle that 
is evolutionarily conserved even to circuits regulating Drosophila 
larvae behavior (Ohyama et al., 2015). Focusing on the last syn- 
apse of motor output pathways affecting movement, we found 
that vestibular and proprioceptive inputs converge on slow and 
fast motor neuron subtypes with an inverse anatomical synaptic 
scaling profile. While we favor the view that functional comple- 
mentarity plays a role in synaptic scaling, whether similar scaling 
processes can also occur between functionally non-comple- 
mentary inputs remains to be determined. 

Our findings raise the question of how and where propriocep- 
tive and vestibular systems interact functionally. Most relevant 
for our study, the vestibular system can enhance proprioceptive 
inputs in a synergistic manner (Grillner et al., 1970). Moreover, 
vestibular input to motor neurons inherently carries multisensory 
information. Vestibular neurons are secondary neurons in the 
chain of sensory input processing, receiving primary vestibular 
sensory input as well as indirect feedback from the propriocep- 
tive and visual system (Angelaki and Cullen, 2008). In particular, 
somatosensory feedback circuits activated by passive hind- 
limb movement regulate vestibular neuron activity (Arshian 
et al., 2014). Thus direct vestibular input to motor neurons com- 
bines multiple sensory streams of different degrees of integration 
and we found that these inputs are organized into precise pat- 
terns and are complementary to direct proprioceptive inputs. 

Developmental Mechanisms Guiding the Assembiy of 
Inputs to Motor Neurons 

The precise developmental assembly of synaptic inputs to motor 
neurons is a prerequisite for the functionality of the mature motor 
system. Despite its importance however, mechanistic insight ex- 
ists for only a limited number of functionally defined neuronal 
subpopulations with synaptic access to motor neurons. The wir- 
ing specificity between proprioceptors and motor neuron pools 
within the same reflex arc is established early and through mech- 
anisms independent of neuronal activity (Frank, 1 990; Mears and 
Frank, 1997). Combinatorial action of neuronal and retrograde 
molecular factors as well as positional cues play important roles 
in instructing sensory-motor connectivity (Arber, 2012; Wenner 
and Frank, 1995). Yet sensory connectivity to synergistic 
motor pools refines at postnatal stages, a process influenced 
by proprioceptor neuron activity (Mendelsohn et al., 2015). 

Here, we have assessed time course and mechanisms of 
vestibular input assembly and refinement to motor neurons. 
We found that while significant input differences between 
extensor (GS) and flexor (TA) motor neurons are already estab- 
lished at early postnatal stages, a likely activity-dependent post- 
natal synaptic refinement process abolishes vestibular input to 
TA motor neurons. We revealed that this process is driven at 
least in part by vestibular signaling itself. The time window during 
which refinement occurs (P1 1 -PI 7) matches the emergence of 
posture and weight bearing in rodents (Geisler et al., 1993), 
raising the possibility that maturation of synaptic input may be 
linked to the emergence of postural behavioral abilities. 



The second level of synaptic input scaling to motor neuron 
subtypes is shaped by bidirectional sensory signaling. Genetic 
manipulations affecting either the functionality of muscle spindle 
feedback or vestibular signaling resulted in adjustments of 
the other channel counterbalancing the genetically imposed 
changes. How could such input adjustment to motor neurons 
be regulated? We found that in mice, proprioceptive connections 
exhibit higher proximal synaptic input with gradually decreasing 
input on distal dendrites, in agreement with recent input recon- 
structions to rat motor neurons (Rotterman et al., 201 4). Interest- 
ingly, compensatory LVe input distribution to GSL1 FF motor 
neurons in Egr3 mutant mice scales accordingly. Since these 
muscle spindle afferent synapses are present but non-functional 
(Chen et al., 2002), it is likely that the observed adjustment of 
synaptic input to motor neurons is not merely a competition for 
synaptic space. A plausible mechanism instead might be that 
synaptic input to motor neurons is regulated locally through 
retrograde and homeostatic mechanisms involving postsynaptic 
feedback from motor neurons. In this context, it is interesting to 
consider that individual group la afferents connect to almost all 
motor neurons supplying the same muscle (Mendell and Henne- 
man, 1968). la input density scaling therefore likely occurs at the 
level of individual motor neurons according to subtype identity. 
Moreover, proprioceptor-driven vestibular synaptic scaling 
only operates on motor neuron pools to which LVe input has 
direct functional impact, as we observed no input scaling to TA 
motor neurons that receive proprioceptive but are devoid of 
LVe input. Thus, the two studied sensory channels differentially 
influence the refinement and scaling process of inputs to motor 
neurons, further supporting the idea that multiple independent 
layers regulate input specificity to motor neuron subtypes. 

Bidirectional synaptic compensation may also explain at least 
part of the relatively minor locomotor phenotypes observed in 
Egr3 mutants (Takeoka et al., 2014) and R4°''^::vGlut2^'°’‘ mice 
analyzed here, both of which exhibit signaling defects in the 
respective sensory system starting during development. Inter- 
estingly, cross-modal sensory regulation during development 
also appears to operate in humans. Patients with infant-onset 
vestibular system dysfunction show limited behavioral abnor- 
malities likely due to somatosensory compensatory mechanism, 
whereas compensation following adult injury to the vestibular 
system is restricted (Horak et al., 1994). These observations 
suggest that there might be a developmentally defined critical 
period for cross-modal sensory regulation to adjust circuitry to 
motor neurons needed for posture and balance. Together, our 
work uncovers how sensory inputs of functionally complemen- 
tary modality converge and influence each other at the final 
output step controlling movement, providing an important 
contribution to understanding specificity and function of the 
motor system. 

EXPERIMENTAL PROCEDURES 
Mouse Genetics 

Jauiox-STOP-Iox-Fip-INLA (pj^etta et al., 2014), Chodf'-^’" (Sakurai et al., 2013), 
NADPH oxidase 3 {Nox3) mutant (Paffenholz et al., 2004), vGlut2^'°^ (Jax 
Mice Strain #007583), Egr3 mutant (Tourtellotte and Milbrandt, 1998), R4^''^ 
(Di Bonito et al., 2013), and Mlc::NT3 (Wang et al., 2007) mouse strains were 
maintained on a mixed genetic background (129/C57BI6). Housing, surgery. 
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behavioral experiments and euthanasia were performed in compliance with 
the Swiss Veterinary Law guidelines. 

Virus Production and Injections 

Rabies viruses (Rabies-mCherry and Rabies-GFP: Rabies-FP) used were 
amplified and purified from local viral stocks following established protocols 
(Stepien et al., 2010; Wickersham et al., 2007). All AAVs used in this study 
were described previously (Esposito et al., 2014; Pivetta et al., 2014; Takeoka 
et al., 2014) and of genomic titers >1 x 10e13. Additional information on 
anterograde and retrograde viral tracing, immunohistochemistry, imaging 
and anatomical quantification are found in the Supplemental Experimental 
Procedures. 

Behavioral Analysis 

Open field, grip strength test, horizontal ladder locomotion, and beam tests 
were performed as previously described (Esposito et al., 2014; Takeoka 
et al., 2014; Carter et al., 2001). Additional information on detailed behavioral 
analyses is found in the Supplemental Experimental Procedures. 

Statistics 

All statistical analysis, plots, and linear regression lines were made using 
GraphPad PRISM (v6.0). Column bar graphs and dot plots represent the 
average value ± SEM. The means of different data distributions were 
compared using an unpaired Student’s t test (Figures ID, IF, 1H, 2A, 2D, 
3D, 3F, 4B, 4D, 5B-5D, 6A, 7D, 7G, S3, S4, S5B, and S6A). Correlation analysis 
was used for Figures 3E and 6B. A one-way ANCVA for independent measure- 
ments was used for comparing multiple TA datasets in Figure 4B. The area 
under the frequency-distribution curves in Figures 71, 7H, S5, and SOB was 
used as a measure of synaptic input on dendrites. The correlogram plot shown 
in Figure 6C was obtained in R using the library ggplot. Significance level is 
defined as follows for all analyses performed: *p < 0.05; **p < 0.01 ; ***p < 0.001 . 

SUPPLEMENTAL INFORMATION 

Supplemental Information includes Supplemental Experimental Procedures 
and six figures and can be found with this article online at http://dx.doi.org/ 
10.1016/j.cell.2015.09.023. 

ACKNOWLEDGMENTS 

We are grateful to M. Mielich for expert technical help, M. Tripod! for help and 
guidance during initial stages of the project, F. Roselli for insights into motor 
neuron subtypes and muscle subcompartments, M. Studer for sharing 
R4::Cre mice with us, S. Bourke, C. Genoud, and L. Gelman from the FMI im- 
aging facility, N. Ehrenfurchter from the Biozentrum Imaging facility and M. 
Stadler from the FMI Bioinformatics Platform for help and advice with image 
acquisition and analysis, and to P. Caroni and B. Roska for discussions and 
comments on the manuscript. E.B. was supported by a fellowship of the 
Werner Siemens Foundation, A.T. by an International Foundation for Research 
in Paraplegia (IRP) fellowship. All authors were supported by an ERG 
Advanced Grant, the Swiss National Science Foundation, the Kanton Basel- 
Stadt, and the Novartis Research Foundation. 

Received: July 8, 2015 
Revised: August 22, 2015 
Accepted: September 1, 2015 
Published: October 8, 2015 

REFERENCES 

Abraira, V.E., andGinty, D.D. (201 3). The sensory neurons of touch. Neuron 79, 
618-639. 

Angelaki, D.E., and Cullen, K.E. (2008). Vestibular system: the many facets of a 
multimodal sense. Annu. Rev. Neurosci. 31, 125-150. 

Arber, S. (2012). Motor circuits in action: specification, connectivity, and func- 
tion. Neuron 74, 975-989. 



Arshian, M.S., Hobson, C.E., Catanzaro, M.F., Miller, D.J., Puterbaugh, S.R., 
Cotter, L.A., Yates, B.J., and McCall, A.A. (2014). Vestibular nucleus neurons 
respond to hindlimb movement in the decerebrate cat. J. Neurophysiol. Ill, 
2423-2432. 

Beraneck, M., and Cullen, K.E. (2007). Activity of vestibular nuclei neurons dur- 
ing vestibular and optokinetic stimulation in the alert mouse. J. Neurophysiol. 
98, 1549-1565. 

Brodal, A., and Pompeiano, O. (1957). The vestibular nuclei in cat. J. Anat. 91 , 
438-454. 

Brown, A.G. (1981). Organization of the Spinal Cord (Springer Verlag). 

Burke, R.E. (1967). Motor unit types of cat triceps surae muscle. J. Physiol. 
193, 141-160. 

Burke, R.E., Jankowska, E., and ten Bruggencate, G. (1970). A comparison of 
peripheral and rubrospinal synaptic input to slow and fast twitch motor units of 
triceps surae. J. Physiol. 207, 709-732. 

Burke, R.E., Dum, R.P., Fleshman, J.W., Glenn, L.L., Lev-Tov, A., O’Donovan, 
M.J., and Pinter, M.J. (1982). A HRP study of the relation between cell size and 
motor unit type in cat ankle extensor motoneurons. J. Comp. Neurol. 209, 
17-28. 

Carter, R.J., Morton, J., and Dunnett, S.B. (2001). Motor coordination and bal- 
ance in rodents. Curr. Protoc. Neurosci. Chapters, Unit 8.12. 

Chen, H.H., Tourtellotte, W.G., and Frank, E. (2002). Muscle spindle-derived 
neurotrophin 3 regulates synaptic connectivity between muscle sensory and 
motor neurons. J. Neurosci. 22, 3512-3519. 

Di Bonito, M., Narita, Y., Avallone, B., Sequino, L., Mancuso, M., Andolfi, G., 
Franze, A.M., Puelles, L, Rijii, F.M., and Studer, M. (2013). Assembly of the 
auditory circuitry by a Hox genetic network in the mouse brainstem. PLoS 
Genet. 9, el 003249. 

Du Beau, A., Shakya Shrestha, S., Bannatyne, B.A., Jalicy, S.M., Linnen, S., 
and Maxwell, D.J. (2012). Neurotransmitter phenotypes of descending sys- 
tems in the rat lumbar spinal cord. Neuroscience 227, 67-79. 

Eccles, J.C., Eccles, R.M., and Lundberg, A. (1957). The convergence of 
monosynaptic excitatory afferents on to many different species of alpha moto- 
neurones. J. Physiol. 137, 22-50. 

Enjin, A., Rabe, N., Nakanishi, S.T., Vallstedt, A., Gezelius, H., Memic, F., Lind, 
M., Hjalt, T., Tourtellotte, W.G., Bruder, C., et al. (2010). Identification of novel 
spinal cholinergic genetic subtypes disclose ChodI and Pitx2 as markers for 
fast motor neurons and partition cells. J. Comp. Neurol. 518, 2284-2304. 

Esposito, M.S., Capelli, P., and Arber, S. (2014). Brainstem nucleus MdV 
mediates skilled forelimb motor tasks. Nature 508, 351-356. 

Frank, E. (1990). The formation of specific synaptic connections between 
muscle sensory and motor neurons in the absence of coordinated patterns 
of muscle activity. J. Neurosci. 10, 2250-2260. 

Geisler, H.C., and Gramsbergen, A. (1998). Motor development after vestibular 
deprivation in rats. Neurosci. Biobehav. Rev. 22, 565-569. 

Geisler, H.C., Westerga, J., and Gramsbergen, A. (1993). Development of 
posture in the rat. Acta Neurobiol. Exp. (Warsz.) 53, 517-523. 

Grillner, S., and Dubuc, R. (1988). Control of locomotion in vertebrates: spinal 
and supraspinal mechanisms. Adv. Neurol. 47, 425-453. 

Grillner, S., Hongo, T., and Lund, S. (1970). The vestibulospinal tract. Effects 
on alpha-motoneurones in the lumbosacral spinal cord in the cat. Exp. Brain 
Res. 10, 94-120. 

Horak, F.B., Shupert, C.L, Dietz, V., and Horstmann, G. (1994). Vestibular and 
somatosensory contributions to responses to head and body displacements in 
stance. Exp. Brain Res. 100, 93-106. 

Kanning, K.C., Kaplan, A., and Henderson, C.E. (2010). Motor neuron diversity 
in development and disease. Annu. Rev. Neurosci. 33, 409-440. 

Kaplan, A., Spiller, K.J., Towne, C., Kanning, K.C., Choe, G.T., Geber, A., Akay, 
T., Aebischer, P., and Henderson, C.E. (2014). Neuronal matrix metalloprotei- 
nase-9 is a determinant of selective neurodegeneration. Neuron 81, 333-348. 



Cell 163, 301-312, October 8, 2015 ©2015 Elsevier Inc. 311 




Cell 



Leroy, F., Lamotted’ Incamps, B., Imhoff-Manuel, R.D., and Zytnicki, D. (2014). 
Early intrinsic hyperexcitability does not contribute to motoneuron degenera- 
tion in amyotrophic lateral sclerosis. eLife 3, 3. 

Liang, H., Bacskai, T., Watson, C., and Paxinos, G. (201 4). Projections from the 
lateral vestibular nucleus to the spinal cord in the mouse. Brain Struct. Funct. 
219 , 805-815. 

Liang, H., Bacskai, T., and Paxinos, G. (2015). Termination of vestibulospinal 
fibers arising from the spinal vestibular nucleus in the mouse spinal cord. 
Neuroscience 294 , 206-214. 

Lund, S., and Pompeiano, O. (1968). Monosynaptic excitation of alpha moto- 
neurones from supraspinal structures in the cat. Acta Physiol. Scand. 73 , 1 -21 . 
Lundberg, A. (1 975). Control of spinal mechanisms from the brain. In The Basic 
Neurosciences (Raven Press). 

Matthews, P.B. (1981). Evolving views on the internal operation and functional 
role of the muscle spindle. J. Physiol. 320 , 1-30. 

Mears, S.C., and Frank, E. (1997). Formation of specific monosynaptic con- 
nections between muscle spindle afferents and motoneurons in the mouse. 
J. Neurosci. 17 , 3128-3135. 

Mendell, L.M., and Henneman, E. (1968). Terminals of single la fibers: distribu- 
tion within a pool of 300 homonymous motor neurons. Science 160 , 96-98. 
Mendelsohn, A.I., Simon, C.M., Abbott, L.F., Mentis, G.Z., and Jessell, T.M. 
(2015). Activity regulates the incidence of heteronymous sensory-motor con- 
nections. Neuron 87 , 111-123. 

Moorman, S.J., Cordova, R., and Davies, S.A. (2002). A critical period for func- 
tional vestibular development in zebrafish. Dev. Dyn. 223 , 285-291 . 

Ohyama, T., Schneider-Mizell, C.M., Fetter, R.D., Aleman, J.V., Franconville, 
R., Rivera-Alba, M., Mensh, B.D., Branson, K.M., Simpson, J.H., Truman, 
J.W., et al. (2015). A multilevel multimodal circuit enhances action selection 
in Drosophila. Nature 520 , 633-639. 

Oliveira, A.L., Hydiing, F., Olsson, E., Shi, T., Edwards, R.H., Fujiyama, F., 
Kaneko, T., Hokfelt, T., Cullheim, S., and Meister, B. (2003). Cellular localiza- 
tion of three vesicular glutamate transporter mRNAs and proteins in rat spinal 
cord and dorsal root ganglia. Synapse 50 , 117-129. 

Paffenholz, R., Bergstrom, R.A., Pasutto, F., Wabnitz, P., Munroe, R.J., Jagla, 
W., Heinzmann, U., Marquardt, A., Bareiss, A., Laufs, J., et al. (2004). Vestib- 
ular defects in head-tilt mice result from mutations in Nox3, encoding an 
NADPH oxidase. Genes Dev. 18 , 486-491 . 

Pecho-Vrieseling, E., Sigrist, M., Yoshida, Y., Jessell, T.M., and Arber, S. 
(2009). Specificity of sensory-motor connections encoded by Sema3e-Plxnd1 
recognition. Nature 459 , 842-846. 

Pivetta, C., Esposito, M.S., Sigrist, M., and Arber, S. (2014). Motor-circuit 
communication matrix from spinal cord to brainstem neurons revealed by 
developmental origin. Cell 156 , 537-548. 



Pun, S., Santos, A.F., Saxena, S., Xu, L, and Caroni, P. (2006). Selective 
vulnerability and pruning of phasic motoneuron axons in motoneuron disease 
alleviated by CNTF. Nat. Neurosci. 9 , 408-419. 

Rotterman, T.M., Nardelli, P., Cope, T.C., and Alvarez, F.J. (2014). Normal dis- 
tribution of VGLUT1 synapses on spinal motoneuron dendrites and their reor- 
ganization after nerve injury. J. Neurosci. 34 , 3475-3492. 

Sakurai, K., Akiyama, M., Cai, B., Scott, A., Han, B.X., Takatoh, J., Sigrist, M., 
Arber, S., and Wang, F. (201 3). The organization of submodality-specific touch 
afferent inputs in the vibrissa column. Cell Rep. 5, 87-98. 

Shinoda, Y., Ohgaki, T., Futami, T., and Sugiuchi, Y. (1988). Vestibular projec- 
tions to the spinal cord: the morphology of single vestibulospinal axons. Prog. 
Brain Res. 76 , 17-27. 

Stepien, A.E., Tripodi, M., and Arber, S. (2010). Monosynaptic rabies virus re- 
veals premotor network organization and synaptic specificity of cholinergic 
partition cells. Neuron 68 , 456-472. 

Takeoka, A., Vollenweider, I., Courtine, G., and Arber, S. (2014). Muscle spin- 
dle feedback directs locomotor recovery and circuit reorganization after spinal 
cord injury. Cell 159 , 1626-1639. 

Tourtellotte, W.G., and Milbrandt, J. (1998). Sensory ataxia and muscle spindle 
agenesis in mice lacking the transcription factor Egr3. Nat. Genet. 20 , 87-91. 
Uchino, Y., and Kushiro, K. (2011). Differences between otolith- and semicir- 
cular canal-activated neural circuitry in the vestibular system. Neurosci. Res. 
71 , 315-327. 

Van Cleave, S., and Shall, M.S. (2006). A critical period for the impact of vestib- 
ular sensation on ferret motor development. J. Vestib. Res. 16 , 179-186. 
Walton, K.D., Harding, S., Anschel, D., Harris, Y.T., and Llinas, R. (2005). 
The effects of microgravity on the development of surface righting in rats. 
J. Physiol. 565 , 593-608. 

Wang, Z., Li, L.Y., Taylor, M.D., Wright, D.E., and Frank, E. (2007). Prenatal 
exposure to elevated NT3 disrupts synaptic selectivity in the spinal cord. 
J. Neurosci. 27, 3686-3694. 

Wenner, P., and Frank, E. (1995). Peripheral target specification of synaptic 
connectivity of muscle spindle sensory neurons with spinal motoneurons. 

J. Neurosci. 15 , 8191-8198. 

Wickersham, I.R., Lyon, D.C., Barnard, R.J., Mori, T., Finke, S., Conzelmann, 

K. K., Young, J.A., and Callaway, E.M. (2007). Monosynaptic restriction of 
transsynaptic tracing from single, genetically targeted neurons. Neuron 53 , 
639-647. 

Wilson, V.J., and Yoshida, M. (1968). Vestibulospinal and reticulospinal effects 
on hindlimb, forelimb, and neck alpha motoneurons of the cat. Proc. Natl. 
Acad. Sci. USA 60 , 836-840. 

Windhorst, U. (2007). Muscle proprioceptive feedback and spinal networks. 
Brain Res. Bull. 73, 155-202. 



312 Cell 163, 301-312, October 8, 2015 ©2015 Elsevier Inc. 




Article 



Cell 



A Molecular Code for Identity in the Vomeronasai 
System 



Graphical Abstract 




Authors 

Xiaoyan Fu, Yuetian Yan, Pei S. Xu 

Wongi Chong, Michael L. Gross, 

Timothy E. Hoiy 

Correspondence 

holy@wustl.edu 

In Brief 

Component-activity matching (CAM), a 
technique for seiecting candidate ligands 
that “expiain” patterns of bioactivity 
across diverse compiex mixtures, 
enabled the discovery of several new 
ligands that explain firing rates in 
functionaiiy distinct classes of 
vomeronasal neurons, including a new 
set of ligands that are female sex 
pheromones for the mouse. 



Highlights 

• CAM is an approach to match candidate iigands and specific 
patterns of bioactivity 

• 23 compounds in mouse urine explain firing rates in 7 ciasses 
of vomeronasai neurons 

• Steroid carboxyiic acids are a new family of vomeronasal 
ligands 

• Steroid carboxylic acids are femaie sex pheromones for the 
mouse 



Fuetal., 2015, Cell 763, 1-11 
October 8, 201 5 ©201 5 Elsevier Inc. 
http://dx.d 0 i. 0 rg/l 0.1 01 6/j.cell.201 5.09.01 2 



CelPress 




Please cite this article in press as: Fu et al., A Molecular Code for Identity in the Vomeronasal System, Cell (2015), http://dx.doi.org/10.1016/ 
j.cell.2015.09.012 



Article 



Cell 



A Molecular Code for Identity 
in the Vomeronasal System 

Xiaoyan Fu,^ Yuetian Yan,^ Pei S. Xu,^ Man Geerlof-Vidavsky,^ Wongi Chong, ^ Michael L. Gross, ^ and Timothy E. Holy^ * 

■'Department of Anatomy and Neurobiology, Washington University in St. Louis, St. Louis, MO 63110, USA 
^Department of Chemistry, Washington University in St. Louis, St. Louis, MO 63110, USA 
*Correspondence: holy@wustl.edu 
http://dx.doi.Org/1 0.1 01 6/j.cell.201 5.09.01 2 



SUMMARY 

In social interactions among mammals, individuals 
are recognized by olfactory cues, but identifying 
the key signals among thousands of compounds re- 
mains a major challenge. To address this need, we 
developed a new technique, component-activity 
matching (CAM), to select candidate ligands that 
“explain” patterns of bioactivity across diverse com- 
plex mixtures. Using mouse urine from eight different 
sexes and strains, we identified 23 components to 
explain firing rates in seven of eight functional clas- 
ses of vomeronasal sensory neurons. Focusing on 
a class of neurons selective for females, we identified 
a novel family of vomeronasal ligands, steroid car- 
boxylic acids. These ligands accounted for much of 
the neuronal activity of urine from some female 
strains, were necessary for normal levels of male 
investigatory behavior of female scents, and were 
sufficient to trigger mounting behavior. CAM repre- 
sents the first step toward an exhaustive character- 
ization of the molecular cues for natural behavior in 
a mammalian olfactory system. 

INTRODUCTION 

Mammals explore the chemical world with several olfactory 
modalities (Ma, 2007). The accessory olfactory system (AOS) 
emphasizes the detection of social cues, sometimes called 
pheromones, that regulate behavior among members of the 
same species (Dulac and Torello, 2003; Halpern and Marti- 
nez-Marcos, 2003). One of the great attractions of the AOS is 
the opportunity to explore the molecular, cellular, and circuit 
underpinnings of behavior in a genetically tractable mammal. 
Perhaps the most substantial current barrier to exploit this 
promise is our incomplete understanding of the natural cues 
that the AOS detects. Knowing more about the nature of the 
stimuli will enable sophisticated studies of sensory coding, cir- 
cuit function, and behavior. 

Several scent sources are known to be involved in chemical 
communication in mammals: secretory glands (lacrimal, salivary, 
and preputial), urine, and feces (Halpern and Martinez-Marcos, 
2003; Kimoto et al., 2005). Mouse urine excites widespread ac- 
tivity among vomeronasal sensory neurons (VSNs) (He et al.. 



2008; Holy et al., 2000; Stowers et al., 2002; Tolokh et al., 
201 3) and is the best behaviorally characterized source of chem- 
ical cues for mammalian social communication (Halpern and 
Martinez-Marcos, 2003). Mouse urine conveys information 
about the sex and strain (Brennan and Keverne, 1997; Kimchi 
et al., 2007; Leypold et al., 2002; Pankevich et al., 2004; Stowers 
et al., 2002). Although progress has been made recently toward 
identifying the molecular nature of pheromone cues by purifying 
individual ligands from mouse urine (Chamero et al., 2007; Hsu 
et al., 2008; Nodari et al., 2008), the identities of olfactory ligands 
inside urine cues are largely unknown; there is not even an esti- 
mate of how many distinct compounds comprise the “olfactory 
identity” of an individual. 

Here, we developed an approach, component-activity match- 
ing (CAM) (Figure 1), with the aim of systematically and exhaus- 
tively defining a “short list” of candidate vomeronasal ligands 
that encode identity. This approach exploits the striking differ- 
ences among natural stimuli across different sexes and strains 
of mice (Tolokh et al., 201 3), which we use as a proxy for the vari- 
ability that one might find at the individual level in natural, genet- 
ically diverse, populations of mice. With CAM, we performed a 
forward screen for compounds that may drive activity in 
vomeronasal neurons, without the need for laborious sample pu- 
rification. Using a combination of physiological recording and 
quantitative liquid chromatography-tandem mass spectrometry 
(LC-MS/MS) (Figure 1), we identified a small set of constituents 
whose concentrations match a pattern of neuronal responsive- 
ness across samples. 

Using this approach, we focused on a class of neurons that 
responded to urine from females of all strains, but not to any 
male strains. We purified and structurally identified the two 
best CAM candidates to explain this pattern of neuronal activ- 
ity. These two candidates are carboxylic acid steroid metabo- 
lites, which we call “cortigynic acid” and “corticosteronic 
acid.” Cortigynic acid accounts for one-fourth of neuronal ac- 
tivity of female C57BL/6J mouse urine. Cortigynic acid was 7- 
fold more abundant in the urine of gonadally intact adult 
female mice than in juvenile mice, was 30-fold higher than in 
an ovariectomized female, and was not detectable in male 
mice. When added to the urine of male mice, cortigynic acid 
induced additional exploratory behavior as observed previ- 
ously with whole female urine (Pankevich et al., 2004; Holy 
and Guo, 2005; Guo and Holy, 2007). Removing cortigynic 
acid from the female urine extract significantly decreased 
male’s investigation time toward the “deficient” female urine 
extract. Finally, we found that the combination of cortigynic 
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Figure 1 . CAM for Ligand Identification 

A diverse collection of samples (here, urine extracts from male and female 
mice of different strains) are analyzed to extract the relative concentrations of 
each component by LC-ESI mass spectrometry and subjected to an assay of 
activity (left, by recordings of spiking responses). For a selected neuron, 
physiological responses (neuronal firing rate, orange bars) to the different 
stimuli are compared to the abundances of individual components (blue and 
green bars). Component 1 (blue bars) is not distributed in a manner that could 
explain the firing rate responses and is therefore an implausible candidate. In 
contrast, component 2 (green bars) has an across-sample distribution 
consistent with the measured neuronal firing rates and is thus a plausible 
candidate to explain the response. 

acid and corticosteronic acid is sufficient to promote male- 
mounting behavior. 

This manuscript describes the first steps in a comprehensive 
analysis of the molecular cues for identity in a mammalian olfac- 
tory system. We applied this approach to a long-standing mys- 
tery (Dixon and Mackintosh, 1975) and discovered the chemical 
identity of female sex pheromones for the mouse. 

RESULTS 

A New Strategy for Identifying Vomeronasal Ligands 
from Complex Mixtures 

In urine, much of the activity that drives firing in VSNs originates 
from multiple compounds with similar physico-chemical proper- 
ties (Nodari et al., 2008). This similarity poses a major challenge 
to traditional methods for identifying the source of neuronal ac- 
tivity from complex mixtures. To overcome the difficulties, we 
developed a new strategy based on the observation that urine 
from different sexes and strains excites differing patterns of ac- 
tivity in vomeronasal neurons (Tolokh et al., 2013). We hypothe- 
sized that these different neuronal patterns could be caused by 
different ligands present in one or more of urine samples. We 
reasoned that the molecular correlates of identity might, there- 
fore, be discoverable by seeking compounds whose abundance 
could “explain” these patterns of physiological responses. By 
picking out compounds whose concentrations across samples 
matched the pattern of neuronal firing (activity), we were able 
to identify plausible VSN candidate ligands without any prelimi- 
nary purification (Figure 1). 



Diverse Vomeronasal Responses to Urine Sampies from 
Different Sexes and Strains 

Inspired by the genetic differences among individuals present in 
natural populations, we selected mice of four laboratory strains 
(BALB/c, C57BL/6J, CBA, and DBA; Figure 1) and tested their 
C18 urine extracts for their ability to drive spiking responses in 
mouse vomeronasal neurons. We used a multi-electrode array 
(Nodari et al., 2008; Arnson et al., 2010; Holy et al., 2000) to re- 
cord spiking responses of large numbers of VSNs simulta- 
neously from 13 intact vomeronasal neuroepithelia isolated 
from B6D2F1 male mice (Figure 2A). To reduce potential con- 
founds from saturation, each of these eight urine extracts was 
tested at three different dilutions: 1:100, 1:300, and 1 :1 ,000. A to- 
tal of 488 single neurons were isolated from these spike record- 
ings. To test the reproducibility of the responses, each stimulus 
was presented once in each of five cycles, each cycle delivering 
stimuli and negative controls alternating with flush solution in 
pseudo-random order. This stimulation paradigm resulted in a 
very large number— a total of 130— stimulus trials for each 
neuron in each preparation (Figure 2A). Responses were quanti- 
fied in terms of the mean change in firing rate, Ar, upon stimula- 
tion (Figure 2B). 

Of 488 single units, 1 21 responded (Ar > 5 Hz and distinguish- 
able from the response to the standard at a level of p < 0.05, t 
test) to at least one stimulus (Figure S1). No neuron passing 
this test responded significantly to the independent negative 
control (Figure S1). Individual neurons showed a diversity of re- 
sponses: some showed specificity to a single sample; others 
responded to a few strains from the same sex; and yet others re- 
sponded selectively for one or the other sex but were relatively 
insensitive to strain. Examination of the entire population of re- 
corded neurons revealed that functionally similar neurons were 
observed repeatedly across different preparations (Figure 2C). 
Of 121 responsive single units, we manually collected 57 of the 
cells into eight functional classes, based on specificity, sensi- 
tivity, and repeated observation of patterns across multiple sin- 
gle units. Of the remaining (not categorized) units, most were 
weakly responsive, and the rest exhibited idiosyncratic re- 
sponses that did not, on their own, appear to clearly justify for- 
mation of a class (Figure S1). These classes served to select 
particularly clear “exemplar” neurons, which we temporarily as- 
sume to represent the entire class, for the purpose of identifying 
the ligands that drive them. 

Chemical Profiles across Sex and Strain 

These neurophysiological responses are presumably attributable 
to the presence of particular ligands in one or more of these sam- 
ples, each at a specific concentration. If a response is driven by a 
single ligand, its pattern should be predictable from the concen- 
tration of the ligand across samples (Figure 1)— samples contain- 
ing higher concentrations of this ligand should drive more activity. 
We reasoned that the molecular correlates of identity might, 
therefore, be discoverable by seeking compounds whose abun- 
dance could “explain” these patterns of physiological response. 

To quantify the relative abundance of the individual molecular 
components, these same urine extracts were profiled with 
LC-MS/MS. To achieve high sensitivity and specificity, we per- 
formed negative-ion nanoLC-MS/MS using a water/acetonitrile 
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Figure 2. Categorization of VSN Responses 

(A) Electrophysiological recording with a muiti-eiectrode array. The voitage signai recorded from a representative eiectrode (indicated by red coior) is iiiustrated. 

(B) Experimentai design: firing rate versus time of one neuron, across five cycies of stimuius presentation. A coiiection of extracts from maies (M) and femaies (F) of 
four strains, encoded by coior, are presented at three different concentrations (encoded by stimuius bar height). Stimuius order is randomized across cycies, and 
each stimuius is presented once per cycie. Note that coiiecting this number of sensory responses required more than 2.5 hr of continuous recording. 

(C) Overiays of aii neurons assigned to one of eight repeatediy identified response patterns. Singie exempiar neurons are shown in boid coiors, and other neurons 
of simiiar functionai type are shown in faint coiors. Red indicates responses to femaie urine; biue indicates responses to maie; and responses to aii three diiutions 
are shown. The primary differentiator of the first two ciasses is their concentration dependence. Error bars represent firing rate SEM. 

See aiso Figure SI. 



gradient for the LC and high mass resolving power (to enable 
mass accuracy of a few ppm). It is apparent (Figure S2A) that 
these samples show high chemical diversity, consistent with 
the differences observed electrophysiologically. When viewed 
at high mass resolving power, individual constituents were 
readily isolated from one another (Figures S2A and S2B). In 
some cases, multiple compounds with the same mass-to- 
charge (m/z) ratio eluted at different times. We used a combina- 
tion of automated and manual algorithms to segment individual 
peaks (see Experimental Procedures), focusing on those repre- 
senting components with reasonably high ion counts in at least 
one sample. Altogether, we quantified the integrated ion counts 
for components represented by 1 ,634 different peaks with 862 
distinct mlz values. In terms of total ion count, these components 
spanned more than four orders of magnitude in abundance (Fig- 
ure S2C), thus representing a “deep” profile of the individual 
constituents of these samples. 

Molecular Correlates of Neuronal Responses 

The central hypothesis of CAM is that individual compounds, 
present in one or more of the urine samples, might be the source 
of individual VSN responses. To test this hypothesis, we com- 
bined the relative abundance of a single peak across samples 
with the dilution factor during stimulus delivery, thereby gener- 
ating an “effective concentration” of this putative ligand across 
all samples tested electrophysiologically. We then examined 
the extent to which the pattern of concentrations across samples 
could explain a VSN’s response (Figure 1). We fit the responses 
to the Hill equation (Hill, 1938; Arnson and Holy, 2013), quanti- 



fying the fitting error by (see Experimental Procedures). 
Thus, each VSN/component pair yielded a single measuring 
the degree to which a particular ligand might “explain” this neu- 
ron’s responses. Collectively, taking one exemplar from each of 
the eight neuronal classes (Figure 2C) led to 1 3,072 distinct mea- 
sures of goodness of fit. 

An example class 2 (Figure 2C, marked with yellow arrow in 
Figure S1) neuron (shown in Figure 3) exhibited a higher firing 
rate to BALB/c females, even at 1:1,000 dilution, than to any 
other sample (Figure 3A). This implies that any ligand explaining 
this pattern of firing must have at least 1 0-fold higher concentra- 
tion in this sample than in any other sample. Most compounds 
did not fit this pattern. To illustrate, we choose one example, a 
compound giving an ion of m/z = 336.0723 eluting at 34%- 
37% acetonitrile (Figure 3B) and exhibiting relatively little selec- 
tivity across samples. Correspondingly, when the concentration 
of this component, incorporating the dilution factor, was paired 
with firing rate, the resulting pattern of firing across samples 
was a poor fit to this model (Figure 3C, x^ = 352.6) and could 
be confidently excluded (p < 10“^°, 21 degrees of freedom). In 
contrast, this constraint was met for a component giving an ion 
of m/z = 427.1793 eluting at 52%-56% acetonitrile (Figure 3D), 
and the firing pattern was well fit by the Hill equation with x^ = 
26.6 (Figure 3E; p = 0.1 8, meaning that the fit cannot be excluded 
as an explanation of this pattern of firing). 

We performed this analysis for all eight “exemplar” neurons 
(one per class), each with all 1 ,634 components. For the exem- 
plar class 1 neuron (Figure 2C, marked with green arrow in Fig- 
ure S1), the fitting error to all components is shown in Figure 3F. 
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Figure 3. Matching Component Concentrations to Single-Neuron 
Activity Patterns 

(A) Firing rate of a neuron from the second ciass in Figure 2C, marked with a 
yeiiow arrow in Figure SI . 

(B) Reiative abundance across sampies of a component with m/z 336.0723 
eiuting between 34% -37% acetonitriie. 

(C) Incorporation of the abundance with the dilution factor for each stimulus 
leads to a putative dose-response curve as a function of the concentration of 
this particular component. Fitting the points to a Hill model leads to a good- 
ness-of-fit measured by x^- This example represents a very poor fit (p < 1 0“^°). 

(D) Another component with miz 427.1793 eluting between 52% -56% 
acetonitrile is most abundant in BALB/c female urine. 

(E) The dose-response curve from this second component fits well (p = 0.18). 

(F) x^ for a class 1 neuron (Figure 2C, marked with a green arrow in Figure SI) 
for each of 1 ,634 components. Only a single component (marked with a green 
circle), the one in (D) and (E), represents a good fit {x^ < 75). 

See also Figure S2. 



Remarkably, only one gave a plausible fit, namely, the com- 
pound giving an ion of mlz = 427.1793 shown in Figures 3D 
and 3E. This accurate mass is consistent with a compound, 
4-pregnene-1 1 ,20,21 -triol-3-one 21 -sulfate (Figure 3F, marked 
with a green circle), that we previously identified from BALB/c 
female mouse urine (Hsu et al., 2008; Nodari et al., 2008). 

This result is a striking demonstration of CAM’s ability to iden- 
tify plausible candidate ligands among the thousands of com- 
pounds in mouse urine. To analyze all eight classes of physiolog- 
ical responses (Figure 2C), we set a generous cutoff of < 75 
and focused on components represented by 400 peaks with 
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Figure 4. Candidate Ligands for Each Class of VSN Response 

Goodness-of-fit plotted for the 400 most abundant components for each 
neuron shown at right. In each case, at most a handful of components 
represent plausible ligands. For each candidate with x? < 75, m/z and elution 
data are listed in Table SI . Circles highlight a previously identified CAM ligand 
m/z 427. 1 793 (4-pregnene-1 1 ,20,21 -triol-3-one 21 -sulfate, green) and the two 
novel CAM candidates to explain a pan-female response (class 5), m/z 
377.1971 (red) and m/z 361 .2018 (purple). 

See also Figure S3 and Table SI . 



highest ion count, as these were most likely to be reliably quan- 
tified across samples and purified in sufficient amounts for 
further experiments. For 7/8 sensory neuron classes, we ob- 
tained one or more plausible candidate ligands (Figure 4; with 
1 , 5, 9, 0, 2, 3, 4, and 1 2 candidates for classes 1 -8, respectively). 
Altogether, this approach identified 23 unique candidate ligands 
to explain these seven patterns of response. 

The entire list of candidates is presented in Table S1 . These 23 
components of urine represent a first short list of molecular cues 
encoding an individual animal’s “vomeronasal identity.” 

Isolation of CAM Candidate Ligands for “Pan-Female” 
Neuron Response Type 

To test whether CAM successfully identified novel ligands, we 
decided to focus on a class of neuronal response lacking any 
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Figure 5. M377 Excites Female-Selective 
Neurons 

(A) Experimental paradigm for calcium-imaging 
experiments. Each stimulus presentation is marked 
by a color-coded bar, each repeated four times. 
GCaMP2 fluorescence intensity for a particular 
neuron is shown. 

(B) Single optical sections of the whole-mount VNO, 
with the dendritic knob layer near the top of the 
image. AF/F is encoded voxel-wise using a red 
color scale. Female extracts excite more neuronal 
activity than male extracts; M377 potently excites a 
subset of neurons. Scale bar, 50 i^m. 

(C) Overlay of AF/F responses across stimuli. Note 
responses to M377 overlap responses to those 
cells activated by female cues. 

(D) Comparison of response amplitudes to C57BL/ 
6J female extract and purified M377. Each neuron 
sensitive to C57BL/6J female extract is shown as a 
separate circle. Diagonal line represents equal 
response to the two stimuli. 

(E) Heatmap of all urine- or M377-responsive neu- 
rons imaged in two male VNOs. Each cell is repre- 
sented as a single column, and cells with similar 
response patterns were grouped. The first three 
groups correspond to three different types of fe- 
male-selective responses. Note that all neurons 
showing roughly equal responsiveness to both 
BALB/c and C57BL/6J female urine are also 
responsive to M377. 

See also Figure S4. 



known ligands. Among mice, pheromones and other social odor 
cues convey information about sex, social status, and identity. 
We selected the “pan-female” response type (the fifth class in 
Figure 2C), as no VSN ligand for “femaleness” has yet been iden- 
tified. This pattern of neuronal activity had just two likely candi- 
dates, of which the best match was a compound that we prelim- 
inarily denoted as M377; the component elutes at 44%-46% 
acetonitrile and gives by mass spectrometry an anion of m/z 
377.1971 (presumably [M - H]“, marked with a red circle in Fig- 
ure 4) in the negative-ion mode. Direct inspection of the nanoLC- 
MS/MS profiles confirmed that this compound was found specif- 
ically in female mouse urine (Figure S3). This class of neurons has 
just one other CAM candidate shown in Figure 4, a compound we 
called M361, eluting at 54%-58% acetonitrile with a miz 
361.2018 (marked with a purple circle in Figure 4). 

The LC data indicated that these two compounds co-elute 
with many others, but that these contaminants are less severe 
in samples from C57BL/6J females. Therefore, we chose this 
strain for further purification. We fractionated 100 ml of C57BL/ 
6J female mouse urine extract by high-performance liquid chro- 
matography (HPLC) (see Experimental Procedures), using mass 
spectrometry to identify fractions containing ions of the antici- 
pated mIz 377 and 361. The corresponding HPLC fractions 
were further purified by thin-layer chromatography (TLC) to 
obtain a sample of high purity. 



M377 Accounts for Approximately One-Fourth of the 
Neuronal Response to Urine from C57BL/6J Females 

To determine whether M377 is the source of activity for all “class 
5” neurons (Figure 4) as predicted by CAM, we used OCPI/light- 
sheet fluorescence microscopy (Holekamp et al., 2008) and cal- 
cium imaging to monitor neuronal activity in VSNs. Using male 
mice expressing GCaMP2 in VSNs (He et al., 2008), we recorded 
responses in the intact whole-mount preparation (Turaga and 
Holy, 2012; Xu and Holy, 2013). We presented urine extracts 
from BALB/c and C57BL/6J males and females, as well as puri- 
fied M377 matching its endogenous concentration in C57BL/6J 
female urine extract. These stimuli were presented in four 
repeated trials, randomizing the order of stimuli within each cycle 
(Figure 5A). We recorded from two epithelial imaging volumes, 
containing an estimated 20,000 vomeronasal neurons (Turaga 
and Holy, 2012). 

We identified 1 ,463 cells as responding robustly to at least one 
of the urine extracts. The largest numbers of neurons were acti- 
vated by urine from females, consistent with electrophysiological 
recordings (Figure SI). Individual neurons were largely sex spe- 
cific and displayed a range of strain selectivity. We also observed 
130 neurons responsive to purified M377, demonstrating that 
this compound indeed activates a subset of neurons in the vom- 
eronasal organ (VNO). Moreover, neurons responsive to M377 
were also invariably responsive to urine from C57BL/6J and 
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BALB/c females, but did not respond to urine from males (Fig- 
ures 5B, 5C, and 5E). 

To investigate quantitatively the relationship between re- 
sponses to M377 and mouse urine, for each M377-responsive 
neuron, we compared the intensity of its AFIF to both M377 
and C57BL/6J female mouse urine. The response amplitudes 
were closely matched over the entire range of response inten- 
sities (Figure 5D), indicating a 1:1 relationship between these 
neurons’ responses to C57BL/6J female mouse urine and puri- 
fied M377. Consequently, we conclude that these neurons’ urine 
responses are caused by M377. 

The characteristics of the M377-responsive neurons shown in 
Figure 5E— approximate equal sensitivity to BALB/c and 
C57BL6 female urine— are consistent with class 5 neurons (Fig- 
ures 2C and 4). Two other types of female-responsive neurons, 
labeled types 2 and 3 (Figure 5E), are consistent with classes 
7-8 and 1-3 (Figure 4), respectively; these neurons were unre- 
sponsive to M377. 

Structural Characterization of M377 

Based on accurate mass measurement, M377 is ionized to an 
[M - H]“ chemical formula of C2iH2g06“, indicating that it is a 
novel ligand. Following determination of its chemical formula, we 
solved its structure by using a combination of MS-based methods, 
including high mass resolving power, tandem MS (MS^ and MS^), 
HD exchange (to count exchangeable hydrogens), and a chemical 
reaction with periodic acid, followed by MS detection. The details 
of structure elucidation are provided in Supplemental Experi- 
mental Procedures and Figures S4, S5, S6 and Table S2. As a 
result of these analyses, the proposed structure of M377 is 1 6-hy- 
droxycorticosterone 20-hydroxy-21 -acid. 

To test the validity of this structural interpretation, we commis- 
sioned a custom synthesis of this proposed compound, as an 
epimeric mixture at carbon 20. Urine-derived M377 and synthetic 
16-hydroxycorticosterone-20-hydroxy-21-acid were indistin- 
guishable by thin-layer chromatography (TLC) and reversed- 
phase HPLC (Figure S4A). Further, product-ion mass spectra of 
the synthetic and endogenous compounds were nearly indistin- 
guishable (Figures 6A and 6B), indicating identical structures of 
the two. Fragmentations by MS^ of major MS^ fragment ions 
from both compounds are also consistent (data not shown). 

We then asked whether synthetic M377 (16-hydroxycorticos- 
terone 20-hydroxy-21-acid) showed the same neuronal activity 
as endogenous M377. The two stereoisomeric components of 
the synthetic epimeric mixture were quantified by HPLC as hav- 
ing a concentration ratio of 10:1, of which the less-abundant 
component matched the endogenous compound in elution 
time (Figure S4A). Synthetic 16-hydroxycorticosterone-20-hy- 
droxy-21-acid and endogenous M377 were presented at three 
different concentrations (1 , 0.1 , and 0.01 |iM) in four repeated tri- 
als, randomizing the order of stimuli within each cycle. VSNs 
responding to the synthetic compound also responded to 
endogenous M377 with identical concentration dependence 
(Figure 6C). These results conclusively demonstrate that M377 
is 1 6-hydroxycorticosterone-20-hydroxy-21 -acid. 

This structure has several distinctive features. To our knowl- 
edge, biological activity of a steroid carboxylic acid has not 
been previously reported. As a 21 -carbon steroid with a hydroxyl 



group at the 1 1 position, this structure likely derives from meta- 
bolism of a glucocorticoid (Wang et al., 2010). However, the hy- 
droxyl group at position 16 is a motif characteristic of a female 
sex steroid, estriol, which in humans is only produced in signifi- 
cant amounts during pregnancy (Raju et al., 1990). We therefore 
named this new VSN ligand “cortigynic acid.” 

Structural Characterization of M361 

Based on accurate mass measurement, M361 gives an [M - H]“ 
chemical formula of C 2 iH 2 g 05 “, indicating that it is a novel 
ligand. By comparing its chemical formula with that of cortigynic 
acid (C 2 iH 2 g 06 “), we deduced that this new unknown com- 
pound might be an immediate metabolic precursor of cortigynic 
acid. We hypothesized that the distinctive C-16 hydroxyl group 
of M377 might be added as the last step, and therefore, we pro- 
posed that M361 is 20-dihydro-corticosterone-21 -carboxylic 
acid. To test our proposed structure for M361 , we also commis- 
sioned a custom synthesis of this second compound. 
Urine-derived M361 and synthetic 20-dihydro-corticosterone- 
21 -carboxylic acid were indistinguishable by thin-layer chroma- 
tography (TLC) (data not shown) and reversed-phase HPLC (Fig- 
ure S4B). MS/MS analysis demonstrated that the product-ion 
mass spectra of the synthetic and endogenous compounds 
are identical (Figures 6E and 6F). These results suggest that 
M361 is 20-dihydro-corticosterone-21 -carboxylic acid, which 
we will also refer to as corticosteronic acid. 

Combinatorial Coding of Female-Specific Ligands 

To learn more about how these ligands are encoded by VSNs, we 
examined responses to a stimulus battery that included corti- 
gynic acid and corticosteronic acid, together with 100-fold 
diluted urine samples from all four strains (BALB/c, DBA, 
C57BL/6J, and CBA) of females. Cortigynic acid and corticoster- 
onic acid were presented individually at their endogenous con- 
centrations in whole and in undiluted urine and in mixtures of 
the two spanning two orders of magnitude in concentration, 
the lowest of these approximately matching their concentrations 
in the diluted urine extract samples. Using male mice expressing 
GCaMP3 in VSNs, we recorded responses in the intact whole- 
mount preparation (Turaga and Holy, 2012; Xu and Holy, 2013) 
by OCPI microscopy (Holekamp et al., 2008). 

From three epithelial imaging volumes, we identified 236 cells 
responding robustly to either cortigynic acid or corticosteronic 
acid. Using a clustering algorithm, we identified four types of re- 
sponses. The first type was highly responsive to all four strains of 
females (the signature of class 5 neurons; Figures 2C and 4) and 
exquisitely sensitive to both cortigynic acid (M377) and cortico- 
steronic acid (M361), exhibiting responses at sub-micromolar 
concentrations as present in the diluted urine samples. With 
the exhaustive sampling of OCPI microscopy and expanded 
range of stimulus concentrations, we also identified three other 
clusters responsive to at least one of these compounds and a 
subset of the mouse strains. One of these was strongly activated 
by BALB/c, DBA, and C57BL6, and perhaps weakly by CBA fe- 
males; these neurons were responsive to high concentrations of 
M377 and M361 , but only weakly responsive at the lowest con- 
centration. As CBA females have the lowest endogenous con- 
centration of M377 (by a factor of approximately 2; see 
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Figure 6. Structural Validation of M377 and M361 

(A) Proposed chemical structure of purified M377 and its MS^ product-ion spectrum. 

(B) Structure of synthetic cortigynic acid and its MS^ spectrum. 

(C) Calcium imaging to purified and synthetic M377 over a range of matched concentrations. 

(D) Response amplitude of each neuron to purified and synthetic M377, pooled across all concentrations. The average response across four trials by each neuron 
to each matched-concentration stimulus combination is shown as a circle. 

(E and F) Chemical structure of purified and synthetic M361 and the MS^ product-ion spectrum. 

(G) Combinatorial coding of female-specific cues. Neuronal responses to 100-fold dilute urine stimuli, M377, M361, and their mixture at three different 
concentrations. 

See also Figures S5 and S6 and Table S2. 



Figure S3), it seems likely that the partial strain-selectivity of this 
class of neurons stems from a threshold effect at the tested con- 
centrations. The two other clusters responded to a subset of 
strains and were not activated by M377, but were activated by 
M361 . Consequently, multiple classes of VSNs (likely expressing 
different vomeronasal receptor genes) participate in a combina- 
torial code for the presence and concentration of these two li- 
gands (Figure 6G). 

Cortigynic Acid Is Specific to Sexually Intact Adult 
Female Mice 

Cortigynic acid is expressed in urine of all tested adult female 
strains at a concentration near 1 |iM and is essentially absent 



from the urine of adult males (Figure S3). To further explore the 
factors that control its expression, we collected urine from estrus 
and non-estrus females, ovariectomized females, and juvenile 
males and females (see Experimental Procedures). Quantitative 
LC-MS/MS of these samples showed that cortigynic acid was 7- 
fold less-concentrated in the urine of P21 juvenile females and 
more than 30-fold lower in the urine of ovariectomized females 
(data not shown). There was no cortigynic acid detected in urine 
from adrenalectomized males and females (data not shown), 
indicating that its synthesis is affected by both the gonads and 
the hypothalamic/pituitary/adrenal axis. Somewhat surprisingly, 
the concentration of cortigynic acid was only modestly (at most 
~30%) modulated by the estrous cycle. To determine whether 
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Figure 7. Male Investigatory and Mounting Behavior 

(A and B) Schematic of the experiment. Male “sniff time” to stimuli presented on a cotton swab is measured by an optical beam-break detector; male mounting 
behavior was scored manually. 

(C) Mean and SEM of the fraction of time spent investigating the swab after initial contact. Males spent longer investigating the sample doped with M377, M377 + 
M361 , female extract, or female urine (*p < 0.001 , **p < 0.00001 , n = 1 1 , t test). 

(D) Removing M377 from female urine extract significantly reduced wild-type males’ investigation time (*p < 0.01 , n = 12, t test). 

(E) Adding M377 back to the “deficient” female urine extract robustly increased male’s investigation time (*p < 0.05, n = 12, t test). 

(F) Duration of mounting toward females painted with vehicle (light blue) and M377 (salmon) (*p < 0.05, n = 12, Mann-Whitney test). 

(G) Duration of mounting toward females painted with vehicle (light blue) and M361 (lime). 

(H) Duration of mounting toward females painted with vehicle (light blue) and M361 + M377 (pink) (*p < 0.02, n = 17, Mann-Whitney test). 

(I) Frequency of mounting toward females painted with vehicle (light blue) and M361 + M377 (pink) (*p < 0.02, n = 17, Mann-Whitney test). 

(J) Mounting duration of individual males toward females painted with vehicle (light blue) and M377 + M361 (pink). 

(K) Mounting frequency of individual males toward females painted with vehicle (light blue) and M377 + M361 (pink). 

See also Figure S7. 



cortigynic acid is species specific, we also examined hamster 
and rabbit urine from both sexes. No cortigynic acid was de- 
tected in these two species, suggesting that cortigynic acid is 
a female-specific cue for mice. 

Cortigynic Acid Increases Investigatory Behavior 
of Male Mice 

Male mice investigate urine from females significantly longer 
than urine from males, in a manner that depends on direct phys- 



ical contact and an intact vomeronasal system (Pankevich et al., 
2004). This behavior depends on a positive cue in female urine, 
rather than a negative cue in male mouse urine: mixtures of the 
two stimuli cause similar investigatory behavior to that triggered 
by female urine alone, when presented at matching concentra- 
tion (Guo and Holy, 2007). 

We used an optical beam-break detector to record bouts of 
voluntary investigation by untrained, freely behaving male mice 
(Figure 7A). Because cortigynic acid, corticosteronic acid, and 
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female urine extract are not volatile and therefore attract little 
notice by themselves, we separately added them to whole 
male mouse urine. We compared investigatory behavior to a 
swab with just male mouse urine against one with male mouse 
urine doped with 1 0 |iM cortigynic acid, cortigynic acid + cortico- 
steronic acid, female urine extract, and female urine. Each male 
was presented with each swab, plus a blank in between, in ses- 
sions of 120 s, spaced over a period of approximately 1 hr. The 
order of presentation of the stimuli was counterbalanced across 
mice, and the experimenter was blinded to the identity of the 
stimuli. 

Males investigated female mouse urine for durations that were 
many times longer than exhibited for male mouse urine. Extracts 
of female-mouse urine were not significantly different (p > 0.1) 
from whole female mouse urine, demonstrating that their partic- 
ular interest in this stimulus is largely triggered by the non-volatile 
components (Figure 7C; **p < 0.00001 , n = 1 1). 

The addition of cortigynic acid alone to male mouse urine also 
triggered a large increase in behavioral investigation (Figure 7C; 
*p < 0.001 , n = 1 1). The addition of corticosteronic acid to corti- 
gynic acid did not significantly increase the male investigation 
time compared with that caused by M377 on its own (Figure 7C; 
p > 0.1, n = 11). To determine whether this effect is simply a 
consequence of novelty, we also tested whether a structurally 
similar compound 1 ,4-pregnandien-1 1 p, 17, 20-triol-3-one-21- 
carboxylic acid (P0750) exhibited a similar effect. Using the 
same testing procedure, we found no difference between male 
urine and male urine doped with P0750 (Figure S7C; p > 0.7, 
n = 1 0), indicating that increased investigation is not a general re- 
action to any steroid carboxylic acid. 

Transient receptor potential channels (TRPC2) play a funda- 
mental role in detecting pheromonal and other chemical social 
recognition signals (Stowers et al.,2002; Leypold et al., 2002; 
Omura and Mombaerts, 2014). In TrpC2~'~ mice, cortigynic 
acid did not increase the investigation time (Figure S7D). 

To complement the gain of function observed when adding 
cortigynic acid to male mouse urine, we asked whether corti- 
gynic acid is necessary by removing it from female mouse urine. 
Female urine extract was separated by HPLC using a methanol/ 
water gradient and then recombined, omitting the two fractions 
(out of 100) containing M377 (Figure S7B). In behavioral testing, 
male mice significantly decreased their investigation of the “defi- 
cient” female urine extract compared to the whole female urine 
extract (Figure 7D; *p < 0.01, n = 12). Adding cortigynic acid 
back to the “deficient” female urine extract significantly 
increased male investigation time (Figure 7E; *p < 0.05, n = 12). 
These results demonstrate that cortigynic acid plays a major 
and specific role in attracting male investigation. 

Cortigynic Acid and Corticosteronic Acid Are Sufficient 
to Promote Mounting Behavior 

Classic experiments demonstrated that female mouse urine 
contains one or more pheromones that stimulate male mounting 
behavior (Dixon and Mackintosh, 1975; Haga-Yamanaka et al., 
2014); however, the chemical identity of the cue(s) has never 
been established. To determine whether cortigynic acid and cor- 
ticosteronic acid might be the missing cue(s), we painted the 
anogenital area of ovariectomized females with these com- 



pounds and exposed them to sexually naive males. Cortigynic 
acid alone significantly increased the total amount of time spent 
mounting (Figure 7F; p < 0.05, n = 1 2), but exhibited only a trend- 
level increase on the total number of mounting episodes (data 
not shown). In contrast, females painted with corticosteronic 
acid alone did not undergo significantly more mounting than 
those painted with water (Figure 7G; p > 0.05, n = 12). Interest- 
ingly, the combination of cortigynic acid and corticosteronic 
acid induced the most robust mounting activity (Figures 7H- 
7K; p < 0.02, n = 17). Thus, individually cortigynic acid is suffi- 
cient to induce mounting, but it exhibits a synergistic effect 
with corticosteronic acid. 

DISCUSSION 

To elucidate the vomeronasal code for identity, we developed 
CAM to perform a broad screen for ligands that “explain” the 
response of VSNs. This strategy exploits natural diversity to 
identify many candidate ligands from a single round of bioassays 
and is intrinsically parallelizable to multiple sensory neuron 
types. CAM’s only requirements— functionally diverse samples 
and a richly quantifiable bioassay— may be met in other studies, 
making the technique broadly applicable. 

Without requiring any purification, CAM identified a single 
compound, out of 1,634 possibilities, to explain responses of a 
neuronal type selective for urine from BALB/c females (Figure 3). 
This single compound had been previously identified from BALB/ 
c female-mouse urine (Nodari et al., 2008), and this finding 
demonstrated that CAM can identify a small handful of plausible 
ligands from hundreds or thousands of candidates. We orga- 
nized neuronal responses into eight reproducibly identified clas- 
ses, of which seven had one or more CAM candidates. Explain- 
ing these responses would seemingly require a minimum of 
seven distinct ligands; our statistical analysis selected a set of 
23 plausible candidates. It is also worth noting that some of these 
23 candidate ligands apply to more than one class of neuron (Ta- 
ble S1). Indeed, our detailed analysis of the class 5 (Figures 2C 
and S1) candidate ligands (M377 and M361) demonstrated 
that four distinct classes of neurons responded to these two li- 
gands. These results suggest that vomeronasal coding may be 
organized around many receptor genes that are tuned with 
differing specificities and sensitivities to a much smaller number 
of behaviorally relevant ligands. This interpretation is consistent 
with the much larger number of receptor genes (~300) than 
candidate ligands (23) in urine. Moreover, the structural similarity 
of cortigynic acid and corticosteronic acid, like that found in an 
earlier ligand discovery (Nodari et al., 2008), suggests an 
emerging general pattern: that the vomeronasal system detects 
multiple intermediates in the same metabolic pathway, using 
multiple receptor genes as a combinatorial code (Figure 6G) to 
provide robust information about concentration and identity 
(Arnson and Holy, 2013). 

Neuronal responses to mouse urine were previously demon- 
strated to exhibit sex selectivity (He et al., 2008; Holy et al., 
2000; Tolokh et al., 2013). However, no urinary ligand exhibiting 
a pattern of sex-, but not strain-, specific distribution had yet 
been reported. Cortigynic acid and corticosteronic acid are the 
first potential cues for encoding “sex” from urinary cues for the 
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mouse vomeronasal system. Cortigynic acid was present at high 
concentration only in intact adult female mice urine and was rela- 
tively constant across strains (Figure S3). 

In behavioral tests, cortigynic acid substantially and signifi- 
cantly increased male investigatory and mounting behavior. 
When added to male urine, this one ligand increased the duration 
of investigation bouts by more than 2-fold. Conversely, removing 
cortigynic acid from female urine extract dramatically reduced 
male investigatory time (Figure 7D). Most remarkably, painting 
ovariectomized females with M377, alone or especially in 
conjunction with M361, significantly increased male mounting 
behavior. Based on results with a close structural analog, we 
speculate that the specific interest in this compound integrates 
multiple structural features, including the 16-hydroxyl functional 
group specific to female mice. 

Previous results (Dixon and Mackintosh, 1975; Haga-Yama- 
naka et al., 2014) demonstrated that estrous urine, but not dies- 
trous urine, was effective in promoting mounting behavior 
toward ovariectomized females. Here, we observed a robust in- 
crease in mounting with compounds whose concentration was 
at most modestly (~30%) influenced by the estrous cycle. There 
appear to be several possible interpretations of this result. First, 
it is possible that even a modest modulation of concentration is 
sufficient to lead to estrous-dependent behavior. Second, it is 
possible that estrous is signaled by the presence of cortigynic/ 
corticosteronic acid together with the absence of one or more 
(currently unknown) diestrous-specific cue(s). In such a scenario, 
the presence of cortigynic/corticosteronic acid would mimic 
estrous even though these compounds do not exhibit dramatic 
cycling. Third, to ensure that we would not miss a positive effect, 
in mounting assays cortigynic acid and corticosteronic acid were 
used at approximately 10-fold excess concentration compared 
to their natural concentrations; it is possible that these ligands 
cross-react to a receptor that is more sensitively tuned to a 
different, estrous-specific ligand. The conditions under which 
our urine samples were collected, with five animals per cage, 
were not optimal for studying the role of estrous; we anticipate 
that this will be an important topic for future studies. 

Together with the identification of receptors, the identification 
of VSN ligands establishes a molecular foundation for under- 
standing how sensory inputs are used in recognition and 
behavior. Although receptors tend to fall into gene families and 
can be exhaustively mapped via genome sequencing and ho- 
mology analysis, organic ligands are diverse and have heretofore 
been discovered idiosyncratically. In this study, we developed a 
robust method that enabled us to systematically pick out plau- 
sible ligands for vomeronasal neurons. These results take a ma- 
jor step toward a comprehensive understanding of the molecular 
code for identity in the vomeronasal system. 

EXPERIMENTAL PROCEDURES 
Mice 

Male B6D2F1 mice 8-12 weeks of age were used in all electrophysiology ex- 
periments. tetO-GCaMP2/OMP-IRES-tTA mice (He et al., 2008) and 
AI38(GCaMP3)/OMP-Cre male mice aged 8-12 weeks were used for imaging 
by objective-coupled planar illumination (OCPI) microscopy (Holekamp et al., 
2008). Prior to dissection, mice were anesthetized with carbon dioxide and 
decapitated. All procedures were approved by the Washington University 



Animal Studies Committee. Sexually naive 3-month-old male B6D2F1 mice 
were used to examine mounting behaviors. Overiectomization of 2-month- 
old C57BL/6J females was performed by a veterinary technician from the 
Department of Comparative Medicine at Washington University in St. Louis. 

Electrophysiological Recording and Calcium Imaging 

Dissection and recording procedures were performed as previously described 
(Arnson et al., 2010; Turaga and Holy, 2012; Xu and Holy, 2013); briefly, intact 
B6D2F1 male vomeronasal epithelia were isolated, mounted on a multi-elec- 
trode array for electrophysiological recording, and adhered onto a nitrocellu- 
lose membrane (0.45 |im; Millipore) for calcium imaging. Before recording, 
the VNO was acclimated by superfusing with Ringer’s for at least 30 min. 
The order of stimuli, together with Ringer’s control, was randomized across 
multiple repeated trials. Further details about stimulus preparation, recording 
method, and data analysis can be found in Supplemental Experimental 
Procedures. 

Component-Activity Matching, Purification, Structurai 
Identification, and Removai of M377 from Urine Extracts 

See Supplemental Experimental Procedures. 

Investigation Behavior 

Male mouse chemosensory investigation was recorded as previously 
described (Guo and Holy, 2007; Holy and Guo, 2005). Periods during which 
the swab was investigated were extracted algorithmically from the voltage 
signal from an optical beam-break detector. Significance was assessed by a 
paired t test. Further detail is presented in Supplemental Experimental 
Procedures. 

Mounting Behavior 

Individual B6D2F1 males were singly housed for at least 2 weeks without 
bedding change prior to the mounting test. Assays were performed in their 
home cages. Videos were scored manually using Interact Mangold software 
(Mangold International). Statistical significance was assessed by the Mann- 
Whitney test. Further detail is presented in Supplemental Experimental 
Procedures. 

SUPPLEMENTAL INFORMATION 

Supplemental Information includes Supplemental Experimental Procedures, 
seven figures, and two tables and can be found with this article online at 
http://dx.d 0 i. 0 rg/l 0.101 6/j.cell.201 5.09.01 2. 
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SUMMARY 

Neurodegenerative diseases have been linked to 
inflammation, but whether altered immunomodula- 
tion plays a causative role in neurodegeneration 
is not clear. We show that lack of cytokine inter- 
feron-3 (IFN-3) signaling causes spontaneous neu- 
rodegeneration in the absence of neurodegenerative 
disease-causing mutant proteins. Mice lacking Ifnb 
function exhibited motor and cognitive learning im- 
pairments with accompanying a-synuclein-contain- 
ing Lewy bodies in the brain, as well as a reduction 
in dopaminergic neurons and defective dopamine 
signaling in the nigrostriatal region. Lack of IFN-3 
signaling caused defects in neuronal autophagy 
prior to a-synucleinopathy, which was associated 
with accumulation of senescent mitochondria. Re- 
combinant IFN-3 promoted neurite growth and 
branching, autophagy flux, and a-synuclein degra- 
dation in neurons. In addition, lentiviral IFN-3 over- 
expression prevented dopaminergic neuron loss in 
a familial Parkinson’s disease model. These results 
indicate a protective role for IFN-3 in neuronal ho- 
meostasis and validate Ifnb mutant mice as a model 
for sporadic Lewy body and Parkinson’s disease 
dementia. 

INTRODUCTION 

Neurodegenerative diseases have disrupted neuronal homeo- 
stasis and their pathologies often overlap. Protein aggregates 
containing a-synuclein (a-syn), which eventually forms larger 
Lewy bodies (LBs), are seen in Parkinson’s disease (PD), demen- 
tia with Lewy bodies (DLB), multiple system atrophy, and in some 
forms of Alzheimer’s disease (AD), all neurodegenerative dis- 
eases associated with aging (Arima et al., 1999; Francis, 2009; 
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Lippa et al., 1998). Often, the protein aggregates contain hyper- 
phosphorylated tau and ubiquitin (Jellinger and Attems, 2008). 

Neurodegenerative events in these diseases are linked to 
inflammation (Mrak and Griffin, 2007; Tansey et al., 2008), but, 
despite this link, defects in genes regulating inflammation do 
not have an established causative role in neurodegeneration. 
We report that deletion of Ifnb, which encodes interferon-p 
(IFN-p), is sufficient to cause a cascade of neurodegenerative 
events. IFN-p belongs to the type I interferon family of cytokines 
and binding to its receptor, interferon-a/p receptor (IFNAR), re- 
sults in immunoregulation including anti-viral and anti-inflamma- 
tory effects (Prinz et al., 2008; Teige et al., 2003, 2006) and 
benefits for multiple sclerosis patients (Liu et al., 2014; Yong 
et al., 1 998). However, the role of IFN-p in classic neurodegener- 
ative diseases is unknown. We report that lfnb~'~ mice developed 
spontaneous pathologies mimicking major aspects of human 
neurodegeneration such as PD and DLB. Ifnb~^~ mice had age- 
associated motor learning defects, neuromuscular deficiencies, 
and cognitive impairment. Ifnb~^~ pathology was associated with 
LBs resulting from defective neuronal autophagy. Autophagy, a 
pathway that degrades long-lived proteins, organelles, lipids, 
and protein aggregates, is essential for neuronal homeostasis 
(Harris and Rubinsztein, 2012), and deleting neural autophagy- 
regulating genes leads to neurodegeneration (Hara et al., 2006; 
Komatsu et al., 2006). 

Our findings indicate a central role for IFN-p in neuronal ho- 
meostasis as a regulator of autophagy-mediated protein 
degradation and accentuates lfnb~^~ mice as a model for neuro- 
degenerative diseases with a-synucleinopathy and dementia 
such as PD and DLB. 

RESULTS 

lfnb~^~ Mice Exhibit Behavioral and Cognitive 
Impairments and Neurodegeneration 

We examined the effect of Ifnb gene deletion on motor coordina- 
tion, learning, and grip strength. !fnb~'~ mice were significantly 
impaired in motor coordination and learning from 3 months 

CrossMark 
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Figure 1. lfnb~'~ Mice Exhibit Spontaneous Behavioral and Cognitive Impairments and Neurodegeneration 

(A) Motor coordination by RotaRod. Data are mean second(s) ± SEM, n = 8-13/group. *p < 0.05, **p < 0.01 for genotype effect per time point by unpaired 
Student’s t test; ^p < 0.05 for aging effect. “®^^p < 0.001 and ^§p < 0.01 for genotype and aging effect, respectiveiy, over time, by two-way ANOVA. 

(B) Wire suspension performance. Data are mean ± SEM, 9-13/group. 

(C and D) (C) Coid and (D) heat taii-fiick iatency. Data are mean ± SEM, 9-1 1/group. 

(B-D) Unpaired Student’s t test. *p < 0.05, **p < 0.01 , ***p < 0.001 . 

(E) Morris water maze test. Data are mean ± SEM, 8-21 mice/group. ***p < 0.001 shows genotype effect; ^p < 0.05 shows age effect; *p < 0.05 comparing 
genotypes by two-way ANOVA and (*) with Bonferroni post hoc test. 

(F) Representative maze swimming pattern on probes 1 and 2. 

(G) TUNEL staining of hippocampus from 1 .5-month-oid mice with methyi green nuciear counterstaining. MoDG, moiecuiar dentate gyrus; GrDG, granuiar 
dentate gyrus; aiv, aiveus. Scaie bar, 200 ^im. 

(H) H&E brain staining showing ioss of granuiar ceii iayer and Purkinje ceiis (arrows) in lfnb~'~ hippocampus (upper) and cerebeiium (iower). Scaie bar, 200 ^im. 

(legend continued on next page) 
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compared to age-, sex-, and weight-matched wild-type (WT) 
littermates and in latency-to-fall time in a wire-suspension 
test (Figures 1 A and 1 B). We saw no differences in 1 .5-month-old 
mice, suggesting that impaired motor coordination, balance, 
and grip strength were associated with age. During repeated 
motor-learning trials, retention time improved significantly in 
older mice compared to lfnb~'~ mice from age 3 months 

(Figure S1A), showing that reduced motor-learning in lfnb~^~ 
mice was associated with aging. 

We assessed somatosensory function with nociception cold- 
and heat-induced tail-flick tests. Latency to tail flick was signifi- 
cantly shorter in lfnb~^~ than Ifnb'^^'^ mice (Figures 1 C and 1 D), 
indicating hyperalgesia and defective nociception toward tem- 
perature-induced pain. 

Forced swimming tests found no differences between lfnb~^~ 
and Ifnb'^^'^ mice in swimming pattern, climbing effort, or immo- 
bility (Figure S1), so lfnb~^~ mice were not defective in locomotor 
activity in water in contrast to land. In water maze tests, lfnb~^~ 
mice had significant spatial- and memory-learning deficits that 
increased with age. During second tests, 6- and 12-month-old, 
lfnb~^~ mice had significantly fewer platform position crossings 
compared to Ifnb^'^ mice, indicating impaired reference memory 
(Figures 1 E and 1 F). Ifnb'^^'^ mice improved between the first and 
second tests at all ages except 12 months; lfnb~'~ mice did not 
improve, indicating impaired spatial learning. This was seen at 
all ages in lfnb~^~ mice measured by escape latency time in the 
second learning block (Figures SIB and SIC). Thus, although 
lfnb~'~ mice began with no major behavioral and cognitive de- 
fects, they developed age-dependent deficits with increased 
penetrance (Table SI). 

IFN-p Is Essential for Neuronal Survival, Neurite 
Outgrowth, and Branching 

We investigated whether lfnb~'~ behavioral deficits were associ- 
ated with neurodegeneration. Apoptotic cells were detected in 
1.5-month-old lfnb~^~ granular layers of olfactory bulbs (Fig- 
ure S1E), the granular dentate gyrus of hippocampus and the 
subventricular zone (Figure 1G), and the striatum (STR) caudate 
putamen including the ependymal cell layer in 12-month-old 
lfnb~'~ mice (Figure SI F), but not detected in Ifnb^'^ sagittal brain 
sections at similar ages. Apoptotic neurons increased in cultured 
ifnb~'~ primary cerebellar granular neurons (CGNs) and was 
reversed by recombinant (r)IFN-p (Figures S1G and S1H). 
Neurons were significantly reduced in the hippocampal CA1 re- 
gion in 3- to 6-month-old lfnb~'~ mice and decreased in Purkinje 
cells of cerebellum. Glial cell counts were unchanged (Figures 
1H and II). 

Supporting in vivo deficits in neuronal circuits, lfnb~'~ mice had 
reduced neurite network formation in the cerebellum, frontal cor- 
tex, and hippocampus granular cell layer (Figure 1J). This was 
confirmed in cultured lfnb~'~ primary cortical neurons (CNs) 
that had reduced neurite length per cell, mean length, and 



branch length per cell after 4 days, with restoration by rIFN-p 
(Figures 1 K and 1 L). A similar trend was seen after 21 days of cul- 
ture (Figure IK). Cultured primary lfnb~^~ CGNs mimicked the 
findings of CNs (Figures SI I and S1J). Thus, behavioral defects 
in old lfnb~^~ mice were associated with neuron death and 
reduced neurite circuits. 

Neuron Gene Profiling Reveals 
Neurodegenerative Paths 

We identified neuron-specific signaling and disease pathways 
caused by IFN-p deficiency with expression microarrays on 
highly pure (>98%) primary CGNs from lfnb~^~ and 
mice plus or minus rIFN-p. lfnb~'~ neurons had 323 upregulated 
and 233 downregulated genes (Figures 2A, 2B, and S2A). 
rIFN-p treatment had differential effects on Ifnb'^^'^ versus 
lfnb~'~ CGNs with approximately equal numbers of downregu- 
lated and upregulated genes, but with relatively few overlaps 
(Figures 2C-2F). Multiple genes in the lfnb~^~ data set annotated 
as neuronal degeneration and the top ten pathways included 
cell death and neurological disorders. Genes in neurite forma- 
tion and branching suggested compromised neurogenesis 
pathways in lfnb~'~ neurons, supporting the in vitro and in vivo 
phenotypes (Figures 2G and 2H). To investigate whether Ifnb 
and signaling via IFNAR affected neurogenesis in vivo, 
3-month-old lfnb~^~ and IFNAR knockout (KO) mice (lfnar~^~) 
and WT littermates were injected with bromodeoxyuridine 
(BrdU). Lack of Ifnb or receptor was associated with reduced 
neurogenesis in the hippocampus; dentate gyrus (Figure 21), 
supporting the microarray data. IFN-p might directly affect 
prosurvival mechanisms as neurotrophin was among the top 
hits in gene set enrichment analysis (GSEA) (Table S2) and 
mRNA for the corticogenesis-regulating transcription factor 
Tox (Artegiani et al., 2015) was reduced in lfnb~^~ neurons (Fig- 
ure S2A). Increased apoptosis and reduced neurogenesis 
might contribute to impaired cognition and reduced hippocam- 
pal neuron numbers. GSEA of Ifnb'^^'^ and lfnb~^~ showed 
enrichment of genes associated with Huntington’s disease 
(HD), PD, AD, and prion diseases, and rIFN-p caused the 
lfnb~^~ profile to resemble WT (Figure 2J), supporting that com- 
mon pathways link neurodegenerative diseases (Shulman and 
De Jager, 2009). 

We compared the lfnb~^~ neuronal gene profile with PD 
and HD mouse models (Fossale et al., 2011). Three sets of 
differentially expressed genes were generated: differentially 
expressed in our data (p < 0.001), in our data and the HD 
model (p < 0.01), or our data and the PD model (p < 0.01) 
(Figures S2A-S2C; Data SI). Clustered genes in lfnb~'~ 
neurons and the HD model were both similarly and oppositely 
regulated. In contrast, most clustered genes in lfnb~'~ neurons 
and the PD model were regulated similarly, thus indicating a 
higher gene signature resemblance with PD than HD (Figures 
S2B and S2C). 



(I) Number of neurons and glial cells in hippocampus of 3- to 6-month-old mice. Data are mean ± SEM, n = 6 mice/group. *p < 0.05 using Student’s t test. 

(J) IF of 3-month-old brain sections. Scale bars, 10 i^m. 

(K) Z stack projections of CNs cultured for 4 and 21 days with or without rIFN-p. Scale bar, 50 |am. 

(L) Quantified neurite length from 4-day-old CNs. 

Data are mean ± SEM, n = 3-4, counting 11-32 cells per experiment. *p < 0.05, **p < 0.01 , ***p < 0.001 by one-way ANOVA. See also Figure SI and Table SI . 
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Lack of Ifnb Causes Defects in the Nigrostriatal 
Dopaminergic Pathway 

Based on the possible PD-related defect in lfnb~^~ neurons, we 
investigated nigrostriatal region integrity. Tyrosine hydroxylase 
(TH)'^ fiber density and TH"^ (dopamine-producing) neurons 
were significantly reduced in the STR and substantia nigra (SN) 
in lfnb~'~ mice versus WT (Figures 2K and 2L). NeuN"^ and 
NeuN'^TH^ cells were reduced in the ventral midbrain, which 
was correlated with reduced TH protein in basal ganglia (BG) in 
lfnb~'~ mice while total cells were unaffected (Figures 2M-20). 
Among differentially expressed genes in lfnb~'~ neurons, 31 
(p = 6.937e-4) were involved in regulating dopamine (DA) 
signaling (Figure 2P), which is involved in coordinating move- 
ments. DA signaling was also defective in the STR. No major dif- 
ferences were found in levels of DA or its metabolite homovanillic 
acid (HVA) (data not shown) in 3- to 6-month-old lfnb~'~ mice, but 
the DA metabolite dihydroxyphenylacetic acid (DOPAC) was 
significantly lower, thus significantly affecting DA/DOPAC and 
DOPAC/FIVA ratios (Figure 2Q). These findings support the 
importance of IFN-p in regulating dopamine turnover and pro- 
tecting dopaminergic neurons. 

Lack of Neuronal IFN-p-IFNAR Signaling Causes Lewy 
Bodies 

Since lfnb~'~ mice defects overlapped with neurodegenerative 
diseases, particularly PD, we examined ifnb~'~ brain pathology. 
Gross anatomy was unchanged, but histological examination 
showed that lfnb~'~ neuron degeneration was associated with 
age-dependent a-synucleinopathy. Staining for a-syn was 
normal in 1 .5-month-old lfnb~'~ brains; by 3 months, a-syn was 
found in LB-like structures in SN; however, a-syn staining inten- 
sity was reduced in 12-month-old lfnb~^~ mice (Figures S3A and 
S3B), likely reflecting degeneration of TH"^ neuron (Figure 2L). 
Alpha-syn and large pathogenic aggregates of phosphorylated 
(pSerl 29) a-syn were found in TH^ DA neurons of SN (Figure 3A). 
At 3 months, a-syn aggregates were widespread in the STR, 
frontal cortex (Figure 3B), hippocampus, and cerebellum (Fig- 
ures S3C and S3D). Alpha-syn'" aggregates and neurites were 
found sporadically in thalamus, the brainstem, and subthalamic 
regions of 3-month-old lfnb~'~ mice (Figures S3E-S3G). Neurons 
with a-syn"" LB-like structures increased with age (6- and 



12-month-old) in lfnb~'~ mouse thalamus (Figures 3C and 3D). 
Whole-brain protein extracts from 1.5-month-old mice were 
not different from WT, but 3-month-old mice had significantly 
increased a-syn in insoluble fractions and no difference in solu- 
ble fractions (Figures 3E-3I). No difference in mRNA for a-syn 
was observed between lfnb~^~ and WT brains (Figure 3J); thus, 
the a-syn accumulation was not due to increased transcription. 
The insoluble a-syn fraction in the BG (including SN) was signif- 
icantly higher in older mice, but the soluble fraction was still 
unchanged. A significant increase in high-molecular-weight 
dimeric, trimeric, and oligomeric a-syn were seen in lfnb~^~ 
mice, but with a decrease in tetramers. While aggregated 
a-syn oligomers are neurotoxic (Rockenstein et al., 2014), 
a-syn tetramers are suggested to be the normal aggregation 
resistant conformation of the protein (Bartels et al., 2011; 
Wang et al., 2011), possibly explaining the tetramer abundance 
in mice. pSer129-a-syn, which is prone to form patho- 

genic fibrillar aggregates, was increased in lfnb~^~ mice (Figures 
3K-3M). lfnar~'~ brains showed a similar pattern to lfnb~'~ 
regarding a-syn and pSer129-a-syn accumulation (Figure S3FI). 
Blocking with a-synge-^o peptide confirmed specific immunore- 
activity for a-syn in lfnb~^~ brains. To ensure that lack of Ifnb did 
not generate crossreacting a-syn-independent aggregates, we 
generated Scna~'~lfnb~^~ double-KO mice (DKO). Ifnb~^~ but not 
Scna~^~ or DKO mice had a-syn"" aggregates or LBs (Figures 
S3I-S3K). 

Accumulated polyubiquitinated proteins are associated with 
neurodegenerative disease (Davies et al., 1997; Hara et al., 
2006; Komatsu et al., 2006). While 1.5-month-old WT and 
lfnb~^~ mice showed not differences, 3-month-old lfnb~^~ brains 
had increased polyubiquitin (Figures 3N-3R), in hippocampus 
and cerebellum (Figures S3L and S3M), SN and locus coeruleus 
(Figures 30 and 3P) and vestibular nuclei of pons (not 
shown). Ubiquitin accumulated with inclusion body abundance, 
as seen in the locus coeruleus of 12-month-old lfnb~^~ mice 
(Figure 3Q) and a-syn"" aggregates colocalizing with ubiquitin 
and phosphorylated Tau (pTau) significantly increased (Figures 
3S and 3T). 

To exclude systemic immune response influences, we 
analyzed lfnar~'~ and nes^’'^:lfnai^'^^' mice. Consistent with 
lfnb~'~ mice, a-syn'" LB-like structures were found in the 



Figure 2. Gene Profiling of \fnb~'~ Neurons Shows Association with Neurodegenerative Diseases 

Mouse Affymetrix 430 2.0 arrays. 

(A-F) Venn diagrams for gene expression anaiysis oilfnb~^~, CGNs with or without riFN-p, n = 3. Genes differentiaiiy reguiated by 1 .5-foid, *p < 0.05. (A, C, 
and E) Upreguiated. (B, D, and F) downreguiated genes. 

(G and H) Gene expression anaiyzed for (G) top ten generai signaiing pathways and (H) nervous system deveiopment and function pathways. 

(i) Neurogenesis in hippocampus dentate gyrus showing iF of an Ifnb^^^ mouse and quantifying aii groups: n = 4 mice/group and six to 12 sagittai sections were 
counted for BrdU'^DCX'^ neurons. 

(J) GSEA Heatmaps of core-enriched pathways; HD, PD, AD, and prion disease pathways were the top four neuroiogic disease pathways without or with 
riFN-p, n = 3. 

(K and L) IHC TH staining of coronai (K) STR and (L) SN sections. Data are (K) mean opticai density (OD) ± SEM and (L) TH^ ceil bodies per SN area ± SEM, 
n = 4 mice. 

(M and N) Stitched IF images showing NeuN (green) and TH (red) (M) immunoreactivity and (N) quantifications of n = 4 mice/group in three to four ventral midbrain 
regions. 

(O) WB of BG from 8-month-old mice (n = 4). Graph, mean integral optical density (lOD) ± SEM of TH bands. 

(P) Overview of the DA system. Red, upreguiated; blue, downreguiated genes from comparative microarray analysis of Ifnb^^^ and lfnb~'~ CGNs. 

(Q) High-performance liquid chromatography (HPLC) analysis of DA, DOPAC, and HVA in STR of 3- to 6-month-old mice, n = 9-10/group. 

For (I), (K), (L), (N), and (Q), *p < 0.05, **p < 0.01 , ***p < 0.001 , by Student’s t test. See also Figure S2 and Data SI . 
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cerebellum, granular and molecular layers (Figure S3N), and 
significantly in thalamus of lfnar~'~ and nes^’'^:lfnai^'^^' mice (Fig- 
ure 3U). The ultrastructure of lfnb~^~ and lfnar~'~ neurons in 
paratenial and central medial thalamic nuclei contained a-syn- 
immunoreactive perinuclear LB-like structures (Figures 3V and 
3W), which were absent in mice (data not shown), under- 

scoring the importance of endogenous IFN-p-IFNAR signaling in 
preventing neuronal proteinopathy. 

Lack of IFN-3 Affects Autophagy 

We used GSEA to identify cellular pathways involved in lfnb~^~ 
neuron pathology. In the top 20 deregulated pathways, three 
were associated with autophagy, which were restored with 
rIFN-p (Figure 4A; Table S2). The autophagy system uses 
adaptor proteins SQSTM1/p62 (hereafter p62) and NBR1 to 
bind ubiquitinated proteins and LC3B-II in the autophagosomal 
isolation membrane. Upon autophagy, cytosolic LC3B-I is con- 
verted to membrane bound LC3B-II and blocking autophagy 
flux, e.g., with Rab7 defects (Gutierrez et al., 2004; Hyttinen 
et al., 2013), causes accumulation of autophagy-targeted pro- 
teins; p62, NBR1, LC3B-II, and organelles. Increased neurons 
with accumulated p62 were seen in 3-month-old lfnb~'~ brains 
compared to WT, predominantly in brainstem, without increased 
p62 mRNA. LC3B-II increased in BG of 1.5-month-old ifnb~'~, 
correlating with increased p62, NBR1 and Rab7, supporting 
defects in autophagy before a-syn, ubiquitin, pTau, and LB 
aggregation. Autophagy was even more deregulated in 6- to 
8-month lfnb~'~ brains (Figures 4B-4H). Defective autophagy 
was also confirmed in lfnar~'~ mice (Figure S4). 

Senescent or damaged mitochondria are degraded by mi- 
tophagy. Large, cytoplasmic, electron-dense aggregates asso- 
ciated with lipid droplets were exclusively found in lfnb~^~ and If- 
nar~'~ thalamic neurons, which correlated with significantly more 
mitochondria than in WT (Figures 4I-4M). Cultured lfnb~'~ neu- 
rons had significantly lower mitochondrial membrane potential 
(MMP) than Ifnb'^^'^ neurons, indicating senescent or damaged 



mitochondria. MMP was not affected by rIFN-p contrary to pos- 
itive and negative regulators (Figure 4N). Thus, IFN-p signaling 
was important in regulating autophagy flux including neuronal 
mitophagy. 

Late-Stage Autophagy Block Causes a-syn 
Accumulation 

lfnb~'~ CNs and CGNs and lfnar~'~ CNs showed autophagy de- 
fects resembling a late-stage autophagy block (Figures 5A, 5B, 
S5A, and S5B). In lfnb~'~ CNs, LC3B-I and LC3B-II were 
increased with p62, NBR1, and K-63-linked ubiquitin, while 
K-48-linked ubiquitin only increased slightly (Figures 5A and 
5B). Both Ifnb^^^ and lfnar~'~ produced detectable IFN-p, unlike 
lfnb~^~ neurons (Figure S5C). To validate a block in autophagy 
flux, fusion between autophagosomes and lysosomes was in- 
hibited with NH4CI. At 1 hr, LC3B-II levels saturated in both ge- 
notypes, which correlated with a significant increase in p62 in 
Ifnb'^^'^ CNs; however, lfnb~^~ CNs had nearly saturated p62 
levels without treatment, and no significant increase was seen 
with NH4CI (Figures 5C and 5D). Autophagy flux was measured 
in CNs with the mRFP-GFP-LC3B construct, which emits 
mRFP and GFP signal in autophagosomes and only mRFP in 
autolysosomes because of low pH in the latter. Ifnb~^~ CNs had 
increased mCherry'^GFP^ autophagosomes and very few autoly- 
sosomes compared to WT again suggesting blocked autophagy 
flux. Treating Ifnb'^^'^ CNs with NH4CI caused a similar distribu- 
tion as lfnb~^~ CNs. Rab7 significantly increased in lfnb~^~ CNs, 
suggesting accumulation of mature autophagosomes. In sup- 
port, fewer LC3B- and p62-positive autophagic vacuoles over- 
lapped with LAMP1, a lysosomal marker, in lfnb~^~ neurons 
indicating flux or fusion problems (Figures 5E-5H). GSEA sug- 
gested dysregulation in lysosomal genes in lfnb~^~ neurons 
(Table S2). However, numbers and dysfunction of lysosomes 
measured as acidification, morphology, or cathepsin expression 
and activity did not differ in Ifnb'^^'^ and lfnb~^~ CNs and brains 
(Figures S5D-S5I). 



Figure 3. \fnb~'~ Mice Develop LBs with a-syn, Ubiquitin, and pTau 

(A) IF of a-syn in SN of 6-month-old mice. Scale bar, 10 |im; 2 |im in close-up lower panel. 

(B) IF of 3-month-old brains showing a-syn"^ neuron in frontal cortex and positive aggregates in SIR. Scale bars, 10 |am. 

(C) IHC of a-syn in thalamus. Scale bar, 20 i^m. 

(D) Number of a-syn-positive LBs in thalamus of 6- and 1 2-month-old mice. ***p < 0.001 by Mann-Whitney U test for two groups, ^^^p < 0.001 by Kruskal-Wallis 
test for all groups. 

(E-G) (E) WB of a-syn and quantified lOD in (F) soluble and (G) insoluble BG fractions of 1 .5-month-old mice. Data are mean ± SEM, n = 4. 

(H and I) (H) WB of a-syn and (I) quantified lOD in TX-100 soluble and insoluble fractions of 3-month-old brains. Data are mean ± SEM, n = 5. 

(J) RT-PCR of a-syn in lfnb~^~ relative to Ifnb'^^'^ brains. Data are mean ± SEM, n = 3-5. 

(K) WB of a-syn (TX-100 soluble and insoluble) from BG of 8-month-old mice and long exposure of whole membranes of soluble fractions. 

(L and M) Quantified lOD of a-syn and pSer129-a-syn in (L) insoluble and (M) soluble fractions of monomeric (short exposure) and high-molecular-weight a-syn 
species (long exposure). Graphs, mean ± SEM, n = 5. 

(N) WB and quantified ubiquitin lOD in BG from 1.5-month-old mice. Data are mean ± SEM, n = 3. 

(O) IHC of ubiquitin in SN at 3 months and quantified ubiquitin^ aggregates/|am^. Scale bar, 200 ^im; 20 |am in inserts. Graph, mean ± SEM, n = 5. 

(P and Q) IHC of ubiquitin in locus coeruleus at (P) 3 months and (Q) 12 months. Scale bar, 200 ^im. 

(R) WB and quantified ubiquitin lOD in brain extracts from 3-month-old mice. Data are mean ± SEM, n = 3. 

(S and T) IF and quantification of (S) ubiquitin^a-syn^ and (T) p-tau^a-syn"^ aggregates in thalami of 12-month-old mice. Scale bar, 10 ^im; data are percentage 

double positives ± SEM. 

(U) IHC of a-syn in mice thalami showing LB-like structures (arrows). Scale bar, 50 i^m. Graph, a-syn^ LB-like structures in thalamus ± SEM, n = 5. ***p < 0.001 by 
one-way ANOVA and Turkey’s post hoc correction test. 

(V and W) Immuno-EM of thalamic neurons from lfnb~'~ and lfnar~'~ mice showing LB-like structures positive for a-syn by (V) immuno-DAB reactivity and (W) 
immunogold labeling. 

For (I), (L), (M), (O), (R), (S), and (T), *p < 0.05, **p < 0.01 , ***p < 0.001 by Student’s t test. See also Figure S3. 
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lfnb~'~ CNs developed ubiquitin"^ a-syn aggregates and 
increased monomeric and high-molecular-weight a-syn after 
21 days of culture (Figures 6A and 6B). Accumulation of a-syn 
was not due to proteasomal defects; turnover of the proteasome 
substrate p53 after cycloheximide treatment and proteasomal 
catalytic activity were uncompromised in lfnb~^~ CNs and un- 
changed by rIFN-p (Figure S6). 

Blocking autophagy flux increased LC3B-II and a-syn, and 
a-syn colocalized with LC3B^ and p62^ autophagosomes in 
lfnb~'~ CNs (Figures 6C and 6D), underscoring the relevance for 
autophagic clearance of a-syn. Aggregation-prone pSer129-a- 
syn was seen in LC3B"' autophagosomes, that rarely overlapped 
with l_AMP1. Collectively, these results demonstrated that 
lack of IFN-p reduced lysosomal fusion and caused a-syn 
accumulation. 

IFN-p Promotes Neuronal Autophagy and a-syn 
Clearance 

LC3B-II and p62 were higher in untreated lfnb~'~ CNs, and, while 
overnight rIFN-p-treatment promoted LC3B-II conversion and 
reduced p62 in Ifnb^^^ CNs, indicating increased autophagy 
flux, rIFN-p reduced p62 but only slightly increased LC3B-II in 
lfnb~^~ CNs. By promoting autophagy, rIFN-p reduced a-syn in 
both Ifnb^^^ and !fnb~'~ CNs (Figures 7A-7D). Effects of rapamy- 
cin, an mTOR-dependent autophagy activator, were similar to 
rIFN-p, but the mTOR-independent inducer trehalose more effi- 
ciently reduced a-syn in CNs from both genotypes, possibly 
through increased a-syn secretion (Ejlerskov et al., 2013). To 
ensure reconstitution of the genetic defect, we cultured lfnb~^~ 
neurons with a low rIFN-p dose for 21 days, which increased 
autophagy flux and a-syn clearance to the level in Ifnb^'^ neurons 
(Figures 7E and 7F). 

Ifnb Gene Therapy Prevents Dopaminergic Neuron Loss 
in a Familial PD Model 

We used lentiviruses to overexpress IFN-p to examine effects on 
a familial PD model induced with human a-syn (hSCNA) in rat 
SNs. Injection of hSCNA and control lentiviruses blocked auto- 
phagy indicated by accumulated LC3B-II, p62, NBR1, Beclini, 
and hSCNA in BG 10 days after SN injection (Decressac et al., 
2013) (Figures 7G and 7H). Ifnb overexpression prevented 
hSCNA and pSer129-a-syn accumulation and restored TFI 
loss. The mice showed improved left paw use compared to right 
paw use; contralateral to injection side of hSCNA/Z/hb and 



hSCNA/control viruses, respectively, 21 days post injection, 
which was associated with preservation of TH'^ fibers in SN (Fig- 
ures 71 and 7J). Ifnb gene therapy also significantly protected 
TH^ dopaminergic neurons from hSCNA-induced SN damage 
(Figure 7K). Thus, IFN-p prevented pathology in a familial model 
of PD by inducing autophagy and a-syn clearance. 

DISCUSSION 

CNS immune activation and inflammation occur in neurodegen- 
erative diseases (Brochard et al., 2009; Lee et al., 2009; Maccioni 
et al., 2009; Tansey et al., 2008), but their role in initiation is un- 
clear. We report that defects in IFN-p-IFNAR signaling, that is 
central to immune regulation, trigger neurodegeneration in the 
CNS of aging mice. IFN-p promotes neurite growth and protein 
degradation by autophagy. Lack of IFN-p causes neural patho- 
logical changes: accumulation of LB-like structures, neural 
apoptosis, and neurogenesis defects. p62 accumulated in 
!fnb~'~ brainstems and BG regions before pathological a-syn 
aggregation. Aggregates and LB-like structures with a-syn 
were seen in brainstem and BG including SN, cortex, thalamus, 
and cerebellum as previously reported in PD and DLB patients 
(Goedert et al., 2013; Mori et al., 2003). 

IFN-p is expressed by neurons to prevent malignant growth 
(Liu et al., 2013) and neuroinflammation (Liu et al., 2014) and is 
produced by choroid plexus epithelial cells in aged mouse and 
human brains without CNS disease (Baruch et al., 2014). In the 
latter study, injection of an anti-IFNAR antibody in the cerebro- 
spinal fluid, however, positively affected some aspects of cogni- 
tion. This approach could have complications as injection of 
full-length antibody might initiate complement-dependent cyto- 
toxicity (Nelson, 2010) and inflammation (Congdon et al., 2013; 
Linnartz et al., 2012) and cause rapid antibody clearance 
compared to isotype controls (Sheehan et al., 2006). Genetic 
generation of Ifnb or Ifnar knockout mice or specifically targeting 
neuroectodermal cells in nes^’'^:lfnai^'^^' mice circumvents such 
issues. We found that neuronal IFN-p-IFNAR signaling is 
required for neurons to withstand age-associated pathology. 
The data also suggest that neuronal IFN-p production in the 
CNS parenchyma might function differently than high produc- 
tion by choroid plexus epithelial cells or potentially resident 
microglia. 

Familial but not sporadic neurodegenerative diseases are 
associated with overexpressed or mutant proteins such as 



Figure 4. Late-Stage Degradative Autophagosomes Accumulate in \fnb~'~ Neurons 

(A) Heatmaps of GSEA core-enriched genes in autophagy-related pathways comparing and lfnb~'~ mouse CGNs, lfnb~^~ with or without 24 hr rlFN-(3 

(100 U/ml), n = 3. 

(B) IHC from pons area using p62 antibodies. Scale bar, 20 i^m. 

(C) Quantification of p62'^ cells/15 mm^. Data are mean ± SEM, n = 5. 

(D) Fold change in p62 mRNA in brain extracts of 3-month-old mice. Data are mean ± SEM, n = 3. 

(E-H) WB of BG of (E) 1 .5-month-old and (G) 6- to 8-month-old mice and (F) and (H) quantified lOD of bands. Data are mean ± SEM of four or five brains. 

(I-L) TEM of 9-month-old mice thalami. (I-J) lfnb~^~ mice showing perinuclear electron-dense late-stage autophagic vacuoles (AV), most surrounded by single lipid 
membrane and associated with lipid droplets. rER, rough endoplasmic reticulum. (K) lfnar~^~ mouse with similar electron-dense aggregates. (L) TEM of mito- 
chondria in thalamic neuron cell body. M, mitochondria. 

(M) Graph, mean ± SEM mitochondria per cell body of 1 2-1 6 thalamic neurons; ***p < 0.001 , ****p < 0.0001 by one-way ANOVA with post hoc Dunnett’s multiple 
comparisons test. 

(N) TMRE analysis of 21 -day-old CNs with mean TMRE area per cell, n = 3. *Treatment effect, and ^genotype effect. 

For (B), (C), (F), (H), and (N), V^p < 0.05, **p < 0.01 , < 0.001 by unpaired Student’s t test. See also Figure S4 and Table S2. 
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Figure 5. \fnb~'~ Neurons Have a Defect in Autophagy Maturation 

(A-H) Ifnb'^^^ and lfnb~^~ primary CNs cultured for 21 days. 

(A) WB from three independent experiments with antibodies as indicated. 

(legend continued on next page) 
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a-syn in PD (Thenganatt and Jankovic, 2014). Overexpression of 
disease-associated proteins (Arima et al., 1999; Jellinger, 2000; 
Lippa et al., 1998) was not required to accumulate LB-like 
structures in lfnb~'~, lfnar~'~, or nes^’'^:lfnai^'^^' neurons. Ifnb or 
Ifnar deletion was the sole trigger for a-syn-containing inclusion 
bodies resulting from defective autophagosome maturation. 
Treatment with rIFN-p promoted autophagy flux and increased 
a-syn clearance in Ifnb'^^'^ and lfnb~^~ neurons, supporting 
IFN-p function in mutated proteins clearance, as suggested 
with ataxin 7 (Chort et al., 2013). Shared pathology of lfnb~^~ 
and nes^''^:lfnai^'^^' mice suggests that the pathogenesis is 
likely driven by initial changes in neuroectodermal neurons 
rather than systemic or local immune activation. In agreement, 
neurons are crucial for regulation of CNS inflammation (Liu 
et al., 2006). 

Protein degradation defects are common in neurodegenera- 
tive pathologies (Davies et al., 1997; Flara et al., 2006; Komatsu 
et al., 2006). We found that endogenous neuronal IFN-p signaling 
is central in regulating protein degradation by autophagy, 
including clearing aged mitochondria. Accumulated aged and 
defective mitochondria may release reactive oxygen species, 
enhancing neuroinflammation and neuronal death (Chaturvedi 
and Flint Beal, 2013). Cognitive and motoric impairments in PD 
is associated with DA dysfunction (Narayanan et al., 2013). 
Excess cytosolic DA is degraded by monoamine oxidase 
(MAO) in the mitochondrial outer membrane and MAO defects 
might cause oxidative stress (Segura-Aguilar et al., 2014). 
Senescent and damaged mitochondria and reduced MAO 
mRNA were found in lfnb~^~ mice and reduced DOPAC in 
lfnb~^~ mice supported dysregulation of MAO and mitochondrial 
genes potentially contributing to neurotoxicity and DA neuron 
death. 

Lack of endogenous IFN-p signaling was associated with 
spontaneous neurodegeneration, impaired motor coordination 
and cognition, and neuronal LB-like inclusions with aging as 
seen in most PD and DLB patients (Jellinger, 2008). Gene 
profiling supported essential IFN-p regulation of neuronal ho- 
meostasis. Neurotrophin was a top hit in GSEA and increased 
Tox mRNA, important for corticogenesis (Artegiani et al., 2015), 
supported IFN-p effects on prosurvival mechanisms. 

We showed that Ifnb gene therapy reversed pathology in a 
familial PD model, by promoting autophagy and a-syn clearance, 
which preserved DA neurons and associated neurologic deficit. 
Our data strongly support an essential role for IFN-p signaling in 
preventing neurodegenerative pathology and suggest lfnb~'~ 
mice as a model for nonfamilial, sporadic neurodegenerative dis- 



eases, particularly PD and DLB, with potential for testing future 
therapies. 

EXPERIMENTAL PROCEDURES 

For detailed procedures, see Supplemental Experimental Procedures. 

Mice and Cell Culture 

lfnb~^~ mice (Erlandsson et al., 1998) were backcrossed 20 generations to 
BIO. Rill or C57BL6. Ifnar~^~ and nes^’'^:lfnai^'^^' mice were in C57BL6 (Prinz 
et al., 2008). WT were lfnb^'~, Ifnb^^^ littermates, or Ifnar^'^ C57BL6 mice. 
C57BL/6JOIaHsd mice (Harlan Laboratories) with a spontaneous deletion of 
part of Snca (a-syn) were crossed with lfnb~'~ mice for Scna~'~!fnb~'~ . Mice 
were housed in standard facilities. Sex- and weight-matched mice were 
used in experiments performed in accordance with the ethical committees in 
Denmark and approved by our institutional review boards. CGNs were from 
6- or 7-day-old cerebella and cortical neuron (CN) cultures from the cortex 
of 1 -day-old mice. 

Behavioral Measurements 

Motor-coordination and -learning were evaluated with an accelerating 
RotaRod (TSE Systems GmbH) automatically recording time before fall. Neuro- 
muscular strength was tested by forelimb hanging time on a bar. Heat and cold 
tail-pain sensitivity was measured by tail-flick latency time after exposure. 

Spatial learning and reference memory were assessed with Morris water 
maze (Vorhees and Williams, 2006) with slight modifications. Swimming pat- 
terns were recorded with Ethovision 3.1 (Noldus Information Technology), 
measuring the time to reach a hidden platform during learning trials and fre- 
quency of platform position crossings during probe tests. Deficits prevalence 
were calculated as behavioral test scores of !fnb~'~ mice that deviated from the 
norm: mean value (SD/2) of the Ifnb^'^ group (Table SI). 

In the cylinder tests, asymmetry in forelimb use during vertical exploration 
was used as a validated measure of akinesia in hemiparkinsonian rodents. 

Cloning 

Mouse Ifnb pCR4IFNb was from transOMIC (accession no. BC1 19395). Ifnb 
was transferred to pCSIl-GW via pCR8TOPOGW (Invitrogen) with conven- 
tional cloning techniques to generate pCSIl-IFNb. PCSIl (without insert) was 
generated by lambda recombination with an empty pCR8GW vector. Plasmid 
inserts were verified by sequencing. 

Surgery with AAV6-hSCNA 

AAV-GFP or AAV-human a-syn/hSNCA-WPRE (Vector Biolabs) together with 
lentiviral vectors pCSIl-IFNb and pCSIl control (3 |al for each virus) were in- 
jected unilaterally into 30 adult female Sprague Dawley rats (Taconic; 225- 
250 g at surgery). Virus was infused at 0.2 i^l/min as described (Decressac 
etal.,2013). 

Immunohistochemistry, Immunofluorescence, and Transmission 
Eiectron Microscopy 

For immunohistochemistry (IHC) and immunofluorescence (IF), either mice 
were perfused and brains fixed in 4% paraformaldehyde (PFA) and paraffin 



(B) Quantified lOD of WB bands normalized to vinculin. Data are mean ± SEM, n = 3. 

(C) WB of CNs with or without NH4CI (20 mM) for 1 or 4 hr. 

(D) Quantified IQD of WB. Data are mean ± SEM of n = 3, V*p < 0.05, **/*^p < 0.OI by one-way ANQVA. *Within and # between genotype differences after NH4CI 
treatment. 

(E) CNs expressing mRFP-GFP-LC3B with or without NH4CI for 2 hr. Arrowheads, colocalized GFP and mRFP (autophagosomes); arrows, mRFP-only vesicles 
(autolysosomes). Scale bars, 10 i^m; 2 i^m in inserts. 

(F) Graphs, mean number of autophagosomes and autolysosomes/cell and vesicle ratio ± SEM, n = 3. 

(G) WB of Rab7. Graph, IQD of WB. Data are mean ± SEM of n = 3. 

(H) IF showing LC3B and p62 colocalized in autophagosomes (arrowheads) and LC3B, LAMP1, and p62 colocalizing in autolysosomes (arrows). Scale bars, 
10 i^m; 2 i^m in inserts. 

For (B), (F), and (G), *p < 0.05, *p < 0.01 , ***p < 0.001 by unpaired Student’s t test. See also Figure S5. 
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Figure 6. Autophagy Block Causes a-syn Accumulation 

(A) IF of lfnb~^~ CN with inclusion body (arrow). Scale bar, 20 |am; 2 ^im in insert. 

(B) WB and quantified lOD bands of monomeric and high-molecular-weight (Mw) a-syn. Data are mean ± SEM, n = 3. *p < 0.05 by unpaired Student’s 
t test. 

(C) WB of Ifnb^^^ CNs treated with 20 mM NH4CI. 

(D) IF of CNs from lfnb~^~ mice. Arrows, triple colocalizing vesicular structures; arrowheads, LC3B and phosphorylated (Seri 29) a-syn-positive vesicular 
structures. Scale bars, 10 ^im; 2 ^im in inserts. See also Figure S6. 



embedded or brains were dissected and snap-frozen before sectioning. 
In vitro neuronal cultures were fixed in 4% PFA before staining. Tissues and 
cells were stained as described (Liu et al., 2014). 

For neurogenesis and BrdU staining, mice were injected intraperitoneally 
(i.p.) once per day (75 ^ig/g body weight) for 5 consecutive days and sacrificed 
2 hr after last injection. Brains were processed and immunostained for BrdU 
and doublecortin (DCX). 

MMP was measured by adding tetramethyirhodamine ethyl ester (TMRE) 
and Hoechst (Life Technologies) to primary CNs. 

IF images were taken with a Zeiss LSM510 confocal scanning microscope 
and IN Cell Analyzer 2200 automated microscope. IHC images were taken 
with a NanoZoomer 2.0-HT digital slide scanner or Olympus BX51 micro- 



scope. Images were quantified with Imaged (Fiji version), IN Cell Investigator, 
CellProfiler, Zeiss Zen, and Adobe Photoshop. 

In situ apoptosis detection was with TUNEL kits (Calbiochem) with Hoechst 
counterstaining or DAB substrate and methyl green counterstaining kits (R&D 
Systems). 

For transmission electron microscopy (TEM), 12-month-old mice were car- 
diac perfused with 2% PFA and 0.2% glutaraldehyde and paratenial, and cen- 
tral medial thalamic nuclei were dissected and processed for epon embedding 
and ultrathin sectioning. Samples were incubated with primary a-syn anti- 
bodies (Leica) and biotinylated (Vector Laboratories) or 1 .4-nm gold-labeled 
secondary antibodies (Nanoprobes) and embedded in epoxy resin. Ultra-thin 
sections were analyzed in a Philips CM100 electron microscope. 
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Transfection and Plasmids 

CNs were transfected with mRFP-GFP-LC3 using Lipofectamine 2000 (Life 
Technoiogies) according to the manufacturer’s description for iive-ceii 
confocai imaging. 

Stereologicai Analysis 

The opticai fractionator method was used to estimate neuron and giiai 
numbers in the hippocampus of 80-|am mouse brain sections stained with 
H&E. Pointcounting techniques, based on the Cavaiieri principie, were used 
to estimate hippocampai voiume on one side of the brain. 

Reai-Time PCR 

Totai RNA was isoiated using a QiAGEN kit (QiAGEN), reverse transcribed into 
cDNA, ampiified, and quantified by SYBR Green (Bio-Rad) detection. Reiative 
mRNA expression was normaiized with giyceraidehyde 3-phosphate dehydro- 
genase {Gapdh) gene. 

High-Performance Liquid Chromatography 

Striatum was dissected and homogenized. Fiitered supernatant was examined 
for DA, DOPAC, and HVA ieveis by reversed-phase HPLCy (Decressac et al., 
2013). 

Western Blots 

Sampies were iysed in 1% Triton X-100 (Sigma) and insoiubie brain peiiets 
were sonicated in UREA/SDS and processed as described (Ejlerskov et ai., 
2013). 

Affymetrix Microarrays 

RNA was extracted with TRi (Sigma) and DNase i (invitrogen) from 3-day-oid 
Ifnb^^^ and lfnb~'~ CGN cuitures in tripiicate with or without 24 hr riFN-p 
(100 U/mi). Affymetrix 430 2.0 microarray chip (SCiBLU, Affymetrix) data 
were anaiyzed with Arraystar 3 (DNA STAR) and quantiie-normaiized and pro- 
cessed by the RMA (Affymetrix) aigorithm. We iog 2 -transformed intensity 
vaiues, and normai-distributed data were tested in unpaired two-taiied 
Student’s t tests, fiitering for differentiai reguiation confidence of 95% 
(p < 0.05). Venn diagrams were created with oneChanneiGUi (Bioconductor). 
Quantiie-normaiized RMA-treated data seiected using a 1 .4-foid cutoff were 
anaiyzed with Ingenuity Pathway Analysis software. 

GSEA data heatmaps were generated by extracting lists of core enriched 
genes from GSEA pathway analysis in R using Heatmap2 (Data SI). 

Comparisons of //hb“^“CGN gene profiles and published PD and HD models 
using Affymetrix data (GSE4758, GSE9038) were quantiie-normaiized together 
and summarized in R using RMA algorithms from the Affy-package. Differential 
expression was determined individually within each experiment for our data 
and published PD and HD models by comparing control/WT samples with 
transgenic samples using standard ANOVA. 



Statistical Analysis 

Data were analyzed with unpaired and paired two-tailed Student’s t tests, 
ANOVA, and Mann-Whitney U, and Kruskal-Wallis tests, p < 0.05 was signifi- 
cant. Error bars are SEM. 

ACCESSION NUMBERS 

The accession number for the raw microarray data reported in this paper is 
Gene Expression Omnibus: GSE63815. 

SUPPLEMENTAL INFORMATION 

Supplemental Information includes Supplemental Experimental Procedures, 
six figures, two tables, and a data file and can be found with this article online 
at http://dx.d 0 i. 0 rg/l 0. 1 01 6/j.cell.201 5.08.069. 
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SUMMARY 

Stem cells reside in specialized microenvironments 
known as niches. During Drosophila development, 
glial cells provide a niche that sustains the prolif- 
eration of neural stem cells (neuroblasts) during star- 
vation. We now find that the glial cell niche also 
preserves neuroblast proliferation under conditions 
of hypoxia and oxidative stress. Lipid droplets that 
form in niche glia during oxidative stress limit the 
levels of reactive oxygen species (ROS) and inhibit 
the oxidation of polyunsaturated fatty acids (PUFAs). 
These droplets protect glia and also neuroblasts from 
peroxidation chain reactions that can damage many 
types of macromolecules. The underlying antioxidant 
mechanism involves diverting PUFAs, including diet- 
derived linoleic acid, away from membranes to the 
core of lipid droplets, where they are less vulnerable 
to peroxidation. This study reveals an antioxidant 
role for lipid droplets that could be relevant in many 
different biological contexts. 

INTRODUCTION 

Stem and progenitor cells drive growth during development, tis- 
sue regeneration, and tumorigenesis (Blanpain and Fuchs, 2014; 
Hanahan and Weinberg, 201 1 ; Slack, 2008). They are regulated 
by growth factors and other signals from a local microenviron- 
ment called the niche (Scadden, 2006). In many different physi- 
ological contexts, tissues containing stem cells are exposed to 
low levels of dietary nutrients or oxygen (Barker, 1995; Dunwoo- 
die, 2009; Pugh and Ratcliffe, 2003). Multiple pathways sense 
these environmental stresses and trigger responses that can 
significantly impact upon metabolism (Tower, 2012). Stem, pro- 
genitor, and tumor cells tend to utilize proportionately more 
glycolysis and less oxidative phosphorylation than differentiated 
cells (Burgess et al., 2014). The low oxygen consumption of 
glycolytic metabolism appears well matched to the physiological 
hypoxia of the niche in which many different stem cells reside 
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(Burgess et al., 201 4; Mohyeldin et al., 201 0). This hypoxic micro- 
environment can itself play a key role in regulating the balance 
between stem cell quiescence, self-renewal, and differentiation. 
Hypoxic stem cells and their niche are often associated with hyp- 
oxia inducible factor (HIF) activity and also with an increase in 
reactive oxygen species (ROS), both of which can act as signals 
promoting glycolysis and metabolic reprograming (Lee and 
Simon, 2012; Ushio-Fukai and Rehman, 2014). If hypoxia or 
other forms of oxidative stress induce ROS levels that are high 
enough to exceed cellular defense mechanisms, then they pro- 
mote harmful oxidation and peroxidation chain reactions that 
can damage lipids, proteins, and nucleic acids (Negre-Salvayre 
et al., 2008). Many types of stem cells therefore synthesize 
high levels of antioxidants such as glutathione (GSH) and also 
antioxidant enzymes such as superoxide dismutase (SOD) and 
catalase (Cat) in order to defend themselves against ROS 
(Wang et al., 2013). 

Neural stem cells are critical for growth of the mammalian CNS 
during development and also for neuronal turnover in the sub- 
ventricular zone and dentate gyrus during adulthood (Okano 
and Temple, 2009; Taverna et al., 2014). In common with other 
stem cells, they are known to reside in a niche that is hypoxic 
even when the external environment is normoxic and nutrient 
rich (Cunningham et al., 2012). It is also well described that 
neonatal brain size is highly protected or spared from the 
increased hypoxia and malnutrition that are experienced during 
intrauterine growth restriction (Barker, 1995; Gruenwald, 1963). 
How then do these environmental stresses alter the properties 
of neural stem cells and/or their niche and which, if any, of these 
changes are adaptive for brain sparing? 

The developing Drosophila CNS is a useful model for investi- 
gating the effects of environmental stresses upon neural stem 
and progenitor cells. Embryonic and larval neuroepithelia give 
rise to neural stem cells called neuroblasts, which divide asym- 
metrically to generate multiple types of neurons and glia (Homem 
and Knoblich, 2012; Pearson and Doe, 2004; Skeath and Thor, 
2003). As with other growing and dividing Drosophila cells, larval 
neuroblasts rely heavily upon glycolytic metabolism (Grant et al., 
2010; Homem et al., 2014; Tennessen et al., 2011). Properties of 
neuroblasts such as their type, number of divisions, and lineage 
composition differ from region to region within the CNS (Li et al., 
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Figure 1. Neuroblast Proliferation Is Spared during Environmental Hypoxia 

(A) Cartoon of Drosophila larva, showing the developing wing discs and CNS. Most proliferating cells correspond to neuroblasts (NBs) and ganglion mother cells 
(GMCs) within the CNS and to epithelial progenitors within the wing disc. 

(B) Larval development timeline (hours after larval hatching), depicting the three larval instars and the hypoxia (2.5% oxygen) regimen used. 

(C) In vitro EdU assay shows that cell proliferation in neuroblast lineages (CNS) is more resistant to hypoxia than that of epithelial progenitors (wing disc). After 
22 hr of normoxic or hypoxic NR, tissues were dissected and incubated in vitro with EdU for 1 hr. 

(D) In vivo EdU assay indicates that cell proliferation in neuroblast lineages (CNS) is more resistant to hypoxia than that of epithelial progenitors (wing disc). The 
CNS and wing disc are shown at 73 hr (after 3 hr of normoxic NR with EdU) and also at 92 hr (after a further 19 hr of normoxic or hypoxic NR in the presence of 

(legend continued on next page) 
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2013; Sousa-Nunes et al., 2010). In the central brain and ventral 
ganglion, neuroblasts undergo two periods of neurogenesis 
separated by a period of cell-cycle arrest called quiescence. 
Exit from quiescence (reactivation) occurs during early larval 
stages and requires dietary amino acids (Britton and Edgar, 
1998; Truman and Bate, 1988). Amino acids are sensed by the 
target of rapamycin (TOR) pathway in the fat body and activate 
a systemic relay signal that triggers insulin-like peptide (lip) 
expression in glia, an important niche for larval neuroblasts (Chell 
and Brand, 2010; Colombani et al., 2003; Dumstrei et al., 2003; 
Sousa-Nunes et al., 2011; Speder and Brand, 2014). Glial lips 
then activate the insulin-like receptor (InR) in neuroblasts leading 
to TOR and phosphatidylinositol 3-kinase (PI3K) signaling and 
re-entry into the cell cycle (Chell and Brand, 2010; Sousa-Nunes 
et al., 2011). Later during development, the growth of neuroblast 
lineages becomes largely independent of all dietary nutrients, 
providing a model for brain sparing (Cheng et al., 2011). At this 
stage, niche glia express a secreted growth factor, jelly belly 
(Jeb), regardless of whether dietary nutrients are present or 
absent. Jeb then activates its receptor anaplastic lymphoma ki- 
nase (Aik) in neuroblasts, thus promoting constitutive rather than 
nutrient-dependent PI3K signaling and growth (Cheng et al., 
2011). Similar brain sparing specific to older larvae has also 
been identified in the Drosophila visual system, where the late 
asymmetric but not the early symmetric divisions of neural 
progenitors tend to be spared from nutrient restriction (Lanet 
et al., 201 3). Importantly, however, it has not been clear in any re- 
gion of the Drosophila CNS whether neuroblasts are also pro- 
tected against other environmental stresses such as hypoxia. 
We now investigate this issue using imaging mass spectrometry, 
CNS lipidomics, and genetic manipulations specific for the 
glial niche or the neuroblast lineage. This combined approach 
identifies a molecular mechanism by which lipid metabolism in 
a stem cell niche can protect the stem cells themselves from 
oxidative damage. 

RESULTS 

The Proliferation of Neuroblast Lineages Is Spared 
during Hypoxia 

Drosophila larvae develop into undersized yet viable adults when 
starved during the third instar, (Bakker, 1959; Beadle et al., 
1938). We previously used a nutrient restriction (NR) protocol in 
which third instar larvae are transferred from a yeast/glucose/ 
cornmeal to an agarose medium (Cheng et al., 2011). During 
NR, there is no increase in overall body mass but growth and 
cell division are spared in CNS neuroblast lineages and also, to 
a lesser extent, in epithelial progenitors of the wing disc (Cheng 
et al., 201 1 ; Figure 1A). 

We first quantified the effects of hypoxia upon cell proliferation 
using the thymidine analog 5-ethynyl-2'-deoxyuridine (EdU). This 



method measures the proliferation of neuroblast lineages, with 
glia representing less than 6% of total EdU"^ cells (Sousa-Nunes 
et al., 2011; data not shown). Larvae exposed to hypoxia are 
known to crawl away from yeast paste (Wingrove and C’Farrell, 
1 999), and we found that, in a 2.5% oxygen environment, they do 
not ingest significant quantities of food (Figure SI). This behav- 
ioral response may normally serve to prevent anoxia by limiting 
the duration of burrowing bouts in semi-liquid medium, and it 
can be synchronized by applying artificial hypoxic-normoxic cy- 
cles (Movie SI). To assess specifically the effects of low oxygen, 
rather than reduced feeding, we raised larvae on yeast/glucose/ 
cornmeal until mid-third instar (~70 hr after larval hatching, ALH) 
and then transferred them to NR medium either in normoxia or 
hypoxia (Figure IB). EdU incorporation by thoracic neuroblast 
lineages and by wing disc progenitors was measured at the 
end (1 hr in vitro) or throughout (22 hr in vivo) the hypoxic period 
(Figures 1 C and 1 D). For both tissues, the percentage of cell pro- 
liferation spared in hypoxia (relative to normoxia) is smaller with 
the in vivo cumulative than with the in vitro endpoint assay. This 
difference may reflect brief exposure of hypoxic larvae to nor- 
moxia prior to dissection and in vitro EdU incubation. Impor- 
tantly, however, both methodologies clearly demonstrate that 
cell divisions of neuroblasts are more resistant to hypoxia than 
those of progenitors in the developing wing. 

Hypoxia Induces Lipid Droplets in Subperineurial and 
Cortex Glia 

We next investigated the metabolic specializations underlying 
the striking hypoxia resistance of neuroblast lineages. Highly 
active glycolysis in dividing neuroblasts is unlikely to account 
per se for their selective hypoxia resistance as it is also present 
in wing discs (de la Cova et al., 2014; Homem et al., 2014; data 
not shown). However, using neutral lipid stains (oil red O and 
LipidTOX), we were intrigued to observe lipid droplets in the 
developing CNS (Figures 2A and 2B). Lipid droplets are cyto- 
plasmic organelles comprising a core of neutral lipids, such as 
triacylglycerols (TAGs) and cholesteryl esters, surrounded by a 
phospholipid monolayer associated with lipases and other pro- 
teins regulating droplet biogenesis, lipid storage, and release 
(Kuhniein, 2012; Walther and Farese, 2012; Young and Zechner, 
2013). In Drosophila, lipid droplets in adipose tissue (known as 
fat body) express the regulatory protein Lsd-2 (perilipin-2) on 
their surface (Bickel et al., 2009; Kuhniein, 2011), and we find 
that this is also the case for droplets in the CNS (Figure 2C). 
CNS lipid droplets, however, tend to be 1-2 iim diameter, smaller 
than many of those in the fat body. On standard food in ambient 
normoxia, the number of CNS lipid droplets increases during 
larval development (Figure S2A). This is consistent with a previ- 
ous lipidomic analysis showing that the late third instar CNS con- 
tains significant quantities of TAGs (Carvalho et al., 2012). Lipid 
droplets in the CNS increase rather than decrease during NR 



EdU). CNS insets in (D) show high-magnification views of EdU'^ ceiis, iarge neurobiasts (dotted circie) are aiready EdU'^ at 73 hr and contribute to the finai EdU 
voiumes. 

Histograms in (C) and (D) depict the average voiume of EdU^ ceiis incorporated per wing disc or per thoracic CNS. Scaie bars, 50 i^m, inset scaie bars, 1 0 i^m. In 
this and subsequent figures, histograms show the mean, scatterplots show individual data points, and error bars are 1 SD. The key shows that histogram bars are 
colored according to t tests (p < 0.05): significant decrease (red), significant increase (green), or no significant change (gray) from the control (black). See also 
Figure S1. 
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(with normoxia) and so, in contrast to those in fat body, do not 
appear to correspond to a nutrient store that is depleted during 
starvation (Figure 2D; Gutierrez et al., 2007). However, lipid drop- 
lets increase during development with a concomitant increase in 
the distance separating neuroblasts from the oxygen-supplying 
cells of the tracheal network (Figures S2B-S2D). We then tested 
directly whether low oxygen tension can induce CNS lipid drop- 
lets. Strikingly, exposing larvae to hypoxia induces a 3-fold gain 
in lipid droplets, over and above the normoxic developmental 
increase (Figures 2D and 2E). Importantly, this strong induction 
is CNS specific, as lipid droplets in the wing disc (Fauny et al., 
2005; Parra-Peralbo and Culi, 2011) do not increase significantly 
during hypoxia (Figures 2D and 2E). Hence, lipid droplets accu- 
mulate in the CNS during normal development and can also be 
strongly induced by hypoxia. 



Figure 2. Hypoxia Induces Lipid Droplets in 
the CNS 

(A) Oil red 0-stained neutral lipids in the CNS of 
a late-third instar larva (control genotype: i/i/”^®) 
raised on a standard diet. 

(B) LipidTOX stained neutral lipids in the CNS of a 
late-third instar larva raised on a standard diet. 
Neutral lipids (red) are prominent in the central 
brain and ventral nerve cord but do not overlap 
with neuroblast lineages (green: nab-Gal4 > CD8:: 
GFP). Scale bar, 50 i^m. 

(0) Neutral lipids (LipidTOX, red) in the thoracic 
ganglion of the late third instar CNS accumulate in 
clusters of lipid droplets labeled with Lsd-2::YFP 
(green). Scale bars, 1 i^m. 

(D) Lipid droplets (LipidTOX) in the CNS of a mid- 
third instar (70 hr) larva fed on a standard diet 
increase during NR normoxia and become more 
numerous during NR hypoxia. The lipid droplet 
(LD) content of the wing disc does not change 
significantly during hypoxia. It is expressed as 
percentage of volume within the thoracic region of 
interest of the ventral nerve cord (white boxes, 
upper row) or the entire wing disc (white outlines, 
lower row). Scale bars, 50 ^im. 

(E) Lipid droplets increase -^25-fold in normoxia 
and '--'SO-fold in hypoxia in the CNS but do not 
significantly change in the wing disc. Note that fold 
change values in this figure are normalized to the 
70 hr start points of Figure 2D. 

See also Figure S2. 



A panel of GFP and Lsd-2::GFP re- 
porters revealed that lipid droplets are 
primarily found in glial cells but not in 
neuroblasts or in their neuronal progeny 
(Figures 3A and 3B). Lsd-2 localizes to 
the phospholipid monolayer of the lipid 
droplet surface where it promotes neutral 
lipid storage by inhibiting TAG lipolysis 
(Kuhniein, 2011). Specific knockdown in 
glia (repo > Lsd-2 RNAi) prevents the 
accumulation of CNS lipid droplets dur- 
ing normal fed development, thus con- 
firming their glial localization (Figures 3C 
and 3D). Reporters specific for glial subtypes showed that the 
majority of lipid droplet-containing cells correspond to the sub- 
perineurial glia of the blood-brain barrier and the cortex glia 
that enwrap neuroblast lineages (Figures 3A and 3E; Kis et al., 
2015). We conclude that CNS lipid droplets mostly localize to 
the niche (glia) but not to the neural stem cells (neuroblasts). 

Glial Lipid Droplets Sustain Neuroblast Proliferation 
during Hypoxia 

We next determined the metabolic origin of glial lipid droplets 
using multi-isotope imaging mass spectrometry (MIMS, Stein- 
hauser et al., 2012). Larvae were raised until mid-third instar on 
diets containing ^^C-labeled glucose or acetate, major carbon 
sources for lipogenesis (Figure 4A). Glial lipid droplets were 
then induced with 22 hr hypoxia. MIMS analysis revealed strong 
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Figure 3. Lipid Droplets Accumulate in Cortex and Subperineurial 
Glia 

(A) Transverse sections (dorsal up) of the thoracic CNS from late-third instar 

larvae raised on standard diet. Lipid droplets (LipidTOX) are observed nearby, 
but not within neuroblast lineages (labeled with nab > CD8::GFP). They localize 
to glial cells {repo > CD8::GFP) including the cortex glia ^ 

CD8::GFP) and the subperineurial glia {moody > CD8::GFP). Scale bars, 
10 |im. 

(B) Lipid droplets (LipidTOX) localize to glia {repo > Lsd-2::GFP) but not to 
neuroblast lineages (e/av*^^^^ > Lsd2::GFP). Strong Lsd2::GFP (GFP) signal 
localizes to the surface of glial lipid droplets but low-intensity signal in glia and 
neuroblast lineages may indicate localization to other organelles such as the 
ER (Fauny et al., 2005). Scale bars, 10 ^im. 

(C and D) Glial-specific knockdown of Lsd-2 {repo > Lsd-2 RNAi) inhibits lipid 
droplet accumulation in late-third instar larvae raised on standard diet (control: 
repo > w”^®). Scale bars, 50 ^im. 

(E) Diagram showing the distribution of lipid droplets (red) within cortex and 
subperineurial glia (green) surrounding a neuroblast lineage (blue), comprising 
a single neuroblast, a ganglion mother cell, and multiple neuronal progeny. 

incorporation from labeled dietary glucose or acetate into 
the core of hypoxic lipid droplets (Figures 4B and S3A-S3C). 
Maximal ratios in tissue cross sections were observed 

with lipid droplets of 1-2 iim diameter, consistent with the previ- 
ous size estimates from confocal microscopy (Figures S3D- 
S3G). The MIMS analysis demonstrates that de novo fatty acid 
synthesis contributes neutral lipid cargo to glial lipid droplets. 
To distinguish whether the relevant de novo lipogenesis is within 
the glia themselves or in another tissue, glial-specific RNAi was 
used to knock down six enzymes of TAG biosynthesis (Fig- 
ure 4A). Lipid droplet induction after 22 hr hypoxia was roughly 
halved with the knockdown of acetyl-CoA carboxylase (ACC) 
or a predicted fatty acid synthase (CG3524) (Figure 4C). Fatty 
acid synthesis is therefore required in a cell-autonomous manner 
for maximal induction of glial lipid droplets. Given that induction 
was not completely blocked by knocking down fatty acid syn- 
thetic enzymes, we also tested whether there is an additional 
contribution from dietary fatty acids. Linoleic acid (Cl 8:2) is an 
omega-6 polyunsaturated fatty acid (PUFA) that is derived 
from the diet and cannot be synthesized by Drosophila (Fig- 
ure 4A). It is one of the major fatty acids in our standard 
Drosophila diet and remains largely intact following ingestion. 
Thus, larvae raised on a diet containing ^^C-linoleic acid retain 
~80% of the ^^C label incorporated into CNS fatty acids as 
linoleate (Figure S3H). MIMS analysis of these larvae revealed 
strong ^^C incorporation in hypoxia-induced CNS lipid droplets 
(Figure 4B). Together with the previous results, this indicates 
that dietary uptake of fatty acids (such as linoleate) and fatty 
acid synthesis both contribute to glial lipid droplets. 

Partial decreases in hypoxia-induced lipid droplets were 
observed for glial knockdowns of the Drosophila orthologs 
of genes encoding enzymes converting fatty acids into lyso- 
phosphatidic acids (GPAT, CG3209, one of three glycerol- 
3-phosphate acyltransferases) or lysophosphatidic acids into 
phosphatidic acids (AGPAT, CG4753, one of four 1-acyl-sn- 
glycerol-3-phosphate acyltransferases) (Figure 4A; for enzyme 
predictions, see Wilfling et al., 2013). More strikingly, hyp- 
oxia-induced lipid droplets were reduced 95%-100% with 
knockdown of the enzymes converting phosphatidic acids into 
diacylglycerols (Lipin, 3-sn-phosphatidate phosphohydrolase). 
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Figure 4. Neuroblast Proliferation during Hypoxia Requires Glial TAG Synthesis 

(A) TAG synthetic pathway showing the enzymes (boid, see text for detaiis) knocked down by ceii-type-specific RNAi in this study. Dietary fatty acids that are 
saturated (SFA), monounsaturated (MUFA), or poiyunsaturated (PUFA) can be incorporated into the pathway at muitipie steps, but, for ciarity, singie entry points 
are depicted. 

(B) Muiti-isotope imaging mass spectrometry (MiMS) ofgiiai iipid dropiets induced by hypoxia in iarvae raised on diets containing 1-''^C-giucose, 1-"'^C acetate, or 

^^C-U-iinoieate (PUFA). Hue saturation intensity (HSi) images show that high ratios are seiectiveiy detected in CNS iipid dropiets (arrowhead shows one 

exampie). Scaie bars, 5 |im. 

(C) Quantification of hypoxia-induced iipid dropiets (foid change at endpoint versus repo controi) with giiai-specific gene knockdowns {repo>RNAi) of TAG 

synthetic genes. 

(D and E) Quantification of neurobiast proiiferation (in vivo EdU assays), during normoxia or hypoxia with neurobiast iineage {nab-Gal4) or giiai {repo-Gal4) 
-specific DGAT1 knockdown (> DGAT1 RNAi, controi: > w"^®). Pooied data are represented as mean ± SD. 

See aiso Figure S3. 

diacylglycerols into TAGs (DGAT1 , a diacylglycerol 0-acyltrans- together suggest that TAGs are the predominant neutral lipids 
ferase), or with knockdown of Lsd-2, which promotes TAG stor- stored in glial droplets. They also demonstrate clearly that en- 

age in lipid droplets (Figures 4A and 4C). The RNAi results zymes converting fatty acids into TAGs are required in glia for 
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Figure 5. ROS Induce CNS Lipid Droplets 

(A) Hypoxia significantly increases ROS (measured by DHE oxidation) in many 
cells of the larval CNS including neuroblasts (red arrow marks one of the 
neuroblasts). Scale bars, 10 lam. 

(B) CNS lipid droplets increase (fold change relative to NR endpoint) in 
response to 22 hr of in vivo exposure to the following pro-oxidants: chronic 
hypoxia (2.5% oxygen), H 2 C 2 (0.5% v/v), tBH (100 mM), ethanol (10% v/v), 
intermittent hypoxia (44 cycles of 10 min normoxia: 10 min anoxia) or FeCl 2 
(1 00 mM). Note that 22 hr of osmotic stress (1 .25 M NaCI) does not significantly 
increase lipid droplets. 

(C) Dietary supplementation with the antioxidant N-acetyl cysteine (100 mM) 
inhibits lipid droplet induction by NR or tBH (100 mM). 

(D) Glial-specific overexpression of Catalase {repoCat) or Superoxide 
Dismutase 2 {repo>SOD2) inhibits lipid droplet induction by hypoxia (2.5% 
oxygen) or tBH (100 mM) (control genotype: repo > w”^®). 

See also Figure S4. 



the hypoxic induction of lipid droplets. Importantly, there is a 
strong requirement for DGAT1, one of two enzymes catalyzing 
the final and only dedicated reaction in TAG synthesis, which is 
required for droplet biogenesis from the endoplasmic reticulum 
(Wilfling et al.,2014). 

To test whether the hypoxic induction of glial lipid droplets is 
functionally linked to neuroblast proliferation, we overexpressed 
YFP::Pros to drive dividing neuroblasts into premature differen- 
tiation (Maurange et al., 2008). This failed to inhibit glial lipid 
droplets, indicating that neuroblast proliferation is not required 
for glial lipid droplet induction during hypoxia (Figures S4A and 
S4B). The preceding RNAi results also allowed us to test for 
the reciprocal regulatory relationship, namely, that lipid droplets 
are required for neuroblast proliferation. Control experiments 
indicated no significant effects upon neuroblast proliferation if 
DGAT1 is knocked down in neuroblast lineages during normoxia 
or hypoxia, and also if DGAT 1 is knocked down in glia during nor- 
moxia (Figures 4D and 4E; data not shown). However, glial-spe- 
cific DGAT1 knockdown during hypoxia approximately halved 
neuroblast proliferation (Figure 4E). Together with the previous 
results, this finding demonstrates that the synthesis and storage 
of TAGS, as lipid droplets in niche glia, are required to sustain 
neuroblast divisions during hypoxia but not during normoxia. 

Reactive Oxygen Species Induce Glial Lipid Droplets 

To investigate the mechanism by which hypoxia induces glial 
lipid droplets, we first tested the HIF pathway. However, no sub- 
stantial stabilization of a HIF::GFP fusion protein was observed in 
glia during hypoxia, and sima homozygotes lacking Drosophila 
HIF-1 activity retained the ability to accumulate lipid droplets 
during hypoxia (Figures S4C and S4D). We next considered 
alternative induction mechanisms. Hypoxia can markedly in- 
crease the levels of reactive oxygen species (ROS) generated 
by the mitochondrial electron transport chain, which, in turn, 
can lead to oxidative stress and cellular damage (Bleier and 
□rose, 2013; Chandel et al., 1998; Olsen et al., 2013). Even in 
normoxic controls, the superoxide reactive dye dihydroethidium 
(DHE) detected more ROS in neuroblasts than in many other 
cells of the CNS (Figure S4E). Following hypoxia, ROS were 
moderately increased throughout the CNS, with higher levels 
detectable in neuroblasts (Figure 5A). We therefore tested 
whether intermittent hypoxia and a variety of chemical pro-oxi- 
dants known to increase ROS (hydrogen peroxide, tert-butyl hy- 
droperoxide (tBH), ethanol, or iron (II) chloride) would be able to 
induce lipid droplets. All of these pro-oxidants induced glial lipid 
droplets at least as strongly as chronic hypoxia, whereas os- 
motic stress (1 .25 M NaCI) did not (Figure 5B). The antioxidant 
N-acetyl cysteine (NAG) is a ROS scavenger and it replenishes 
the reduced glutathione pool (Atkuri et al., 2007). When added 
to the substrate, NAC inhibited both the moderate and strong in- 
creases of glial lipid droplets observed during NR and tBH stress, 
respectively (Figure 50). In addition, glial-specific overexpres- 
sion of the antioxidant enzymes catalase (Cat) or superoxide dis- 
mutase 2 (Sod2) inhibited hypoxic or tBH induction of glial lipid 
droplets (Figure 5D). Together, the chemical and genetic manip- 
ulations demonstrate that ROS are an important stimulus for 
inducing glial lipid droplets during normal development and in 
the presence of environmental pro-oxidants. 
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Figure 6. PUFAs Redistribute from Phospholipids to TAGs during Oxidative Stress 

(A) Lipid dropiets foiiowing NR, hypoxia (2.5% oxygen), or tBH (1 00 mM) are more abundant on a diet suppiemented with 20 mM iinoieate (Cl 8:2) than with 20 mM 
stearate (Cl 8:0). 

(B) Neutrai ioss scans indicate that paimitate (Cl 6:0), stearate (Cl 8:0), oieate (Cl 8:1), and iinoieate (Cl 8:2) are the major fatty acids present in CNS TAGs 
foiiowing NR. Dietary 20 mM iinoieate (Cl 8:2) ieads to a reiative enrichment of this PUFA in the TAG pooi. X vaiues are m/z, and Y vaiues are reiative signai 
intensities, scaied to the iargest Cl 6:0 peak. 

(C) Positive ion scans indicate the major PC mass enveiopes (gray boxes) in the CNSs of NR iarvae. Dietary 20 mM iinoieate (Cl 8:2) ieads to a reiative decrease in 
PC 1 6:0/1 6:0 and a reiative enrichment of PC species containing Cl 8:2. X vaiues are m/z and Y vaiues are reiative signai intensities, scaied to the PC 1 6:0/1 8:2 
peak. 

(D) Totai FA composition (by GC-MS) of iarvae raised on 20 mM iinoieate diet. Neither hypoxia (n = 3) nor 1 00 mM tBH (n = 3) iead to major changes, compared to 
NR (n = 3), in the reiative abundance of the major fatty acids (top to bottom: Cl 8:2, Cl 8:1, Cl 8:0, Cl 6:1, Cl 6:0, and Cl 4:0). 

(E) Reiative distribution of major fatty acids (Cl 6:0, Cl 8:1 , and Cl 8:2) in the TAG, PC, and PE poois of CNSs from iarvae on 20 mM iinoieate diet. Hypoxia (n = 3) 
and 100 mM tBH (n = 4) both iead to a strong decrease, compared to NR (n = 3), in the proportion of aii three major fatty acids in PCs and PEs, with a corre- 
spondingiy strong increase in their proportions in TAG. 

(legend continued on next page) 
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PUFAs Redistribute from Phospholipids to TAGs during 
Oxidative Stress 

ROS attack the carbon-carbon double bonds in linoleic acid and 
other PUFAs, inducing lipid peroxidation chain reactions that can 
damage membrane phospholipids as well as other macromole- 
cules (Negre-Salvayre et al., 2008). In the presence of oxidative 
stress (hypoxia or tBH), glial lipid droplets are induced more 
strongly when the diet is supplemented with linoleic (C18:2) 
than with stearic (C18:0) acid (Figure 6A). Mass spectrometry 
of isolated CNSs indicates that dietary supplementation with 
linoleic acid increases the proportion of this PUFA in the TAG, 
phosphatidylcholine (PC), and phosphatidylethanolamine (PE) 
pools (Figures 6B and 6C; data not shown). On the high-PUFA 
diet, neither hypoxia nor tBH detectably alter the total fatty 
acid profile of the CNS (Figure 6D). Both oxidative stresses do, 
however, produce a large decrease in the proportion of CNS 
linoleate in membrane phospholipids (PC and PE), with a corre- 
spondingly large increase in the proportion in TAG (Figure 6E). 
This redistribution from phospholipids to TAG is not selective 
for linoleate as it is also observed with the abundant monounsat- 
urated and saturated fatty acids oleate (C18:1) and palmitate 
(C16:0) (Figure 6E). Together with our previous confocal micro- 
scopy and MIMS analyses, the mass spectrometry strongly sug- 
gests that oxidative stress leads to a bulk redistribution of fatty 
acids (including linoleic acid) from membranes to lipid droplets. 

To identify how fatty acids are redistributed from CNS mem- 
branes to lipid droplets, we developed a simplified in vitro model 
that minimizes any input from fatty acid uptake or synthesis. Ex- 
planted CNSs were cultured in a saline medium lacking all lipids, 
sugars, and amino acids. This stress is sufficient to induce lipid 
droplets in an ACC-independent manner, but they are much 
smaller than those formed in vivo (Figures S5A and S5B). Never- 
theless, droplet accumulation in vitro does require DGAT1 and 
Lsd-2, as it does in vivo (Figure S5B). Moreover, we found 
that the enzymes catalyzing the conversions of phosphatidyl- 
choline (PC) into phosphatidic acid (PA) and PA into diacylgly- 
cerol (phospholipase D (PLD) and Lipin, respectively) are both 
required in glia for lipid droplet induction in vitro and in vivo (Fig- 
ures 4A, S5B and S5C). Together, the in vitro and in vivo genetic 
analyses suggest that transfer of fatty acyl chains from mem- 
brane phospholipids to TAGs, via a PLD/Lipin/DGAT1 pathway, 
contributes to the biogenesis of glial lipid droplets during oxida- 
tive stress. 

Glial Lipid Droplets Protect Dividing Neuroblasts from 
PUFA Peroxidation 

On the high PUFA diet, oxidative stress from tBH or hypoxia 
selectively depletes linoleic acid (relative to monounsaturated 
and saturated fatty acids) from the PC pool of the CNS (Fig- 
ure 6F). On a standard diet in the presence of iron (II), a strong 



pro-oxidant (Olsen et al., 2013; Figure 5B), selective depletion 
of linoleate from PCs was also observed (Figure S5D). Although 
linoleic acid is selectively lost from the PC pool during oxidative 
stress, we failed to detect its selective increase (or decrease) in 
the TAG fraction during iron, tBH, or hypoxic stress (Figures S5E 
and S5F). Hence, the loss of linoleate from PCs may not reflect its 
selective remobilization to lipid droplets but could result from 
its selective susceptibility (over more saturated fatty acids) to 
ROS-induced peroxidative damage. Indeed, tBH stress on a 
high linoleate diet does lead to increased PUFA peroxidation, 
as detected by a ratiometric fluorescent sensor (C11-BODIPY 
581/591, Figure 6G). Strikingly, the ratio of oxidized to non- 
oxidized sensor in the CNS is much higher in cell membranes 
than in lipid droplets. Importantly, the PUFA peroxidation ratio 
of CNS membranes increases even further if lipid droplet accu- 
mulation is abrogated by Lsd-2 knockdown (Figure 7A). Even 
though the knockdown was glial specific and a very strong in- 
crease in PUFA peroxidation was observed in glial membranes, 
a moderate increase in PUFA peroxidation was also detected in 
the membranes of neuroblast lineages. This indicates that glial 
lipid droplets exert a non-cell autonomous and protective effect 
upon neuroblast lineages. Together with the mass spectrometry, 
the fluorescent sensor provides evidence that PUFAs like linoleic 
acid are more susceptible to peroxidation when incorporated 
into membrane phospholipids than when stored as TAGs in the 
core of lipid droplets. 

PUFA peroxidation can also lead to the damage of non-lipid 
macromolecules. For example, peroxidation chain reactions of 
omega-6 PUFAs such as linoleic acid generate 4-hydroxy-2- 
nonenal (4-HNE), an unsaturated aldehyde that forms covalent 
adducts with proteins (Uchida, 2003). Following in vitro incuba- 
tions with exogeneous 4-HNE, such protein adducts can be 
detected by immunocytochemistry in most/all cells of the CNS, 
including neuroblasts (Figure S6A). Consistent with studies in 
other biological contexts (Uchida, 2003), 4-HNE induced ROS 
throughout the CNS but, as seen previously during hypoxia 
and even during normal development, levels were higher in cells 
of a large size characteristic of neuroblasts (Figure S6B). 4-HNE 
also markedly decreased the proliferation of neuroblasts, but this 
was efficiently rescued with the antioxidant N-acetylcysteine 
amide (AD4) (Figures S6C and S6D). These in vitro CNS experi- 
ments reveal that 4-HNE and ROS induce each other and also 
that high levels of one or both are deleterious for neuroblast 
proliferation. 

We next determined in vivo whether glial lipid droplets function 
not only to defend membrane lipids against peroxidation, but 
also to protect proteins from 4-HNE damage. For the control 
genotype, no obvious increase in 4-HNE protein adducts was 
observed with tBH stress on standard or high PUFA diets (Fig- 
ures 7B and 7C). In contrast, for glial Lsd-2 knockdown, tBH 



(F) Positive ion scans of PC 1 8:x/1 8:x from the CNSs of iarvae on a 20 mM iinoieate diet. The ratio of 1 8:2/1 8:2 to 1 8:0/1 8:2 is decreased by hypoxia (n = 3, mean = 
4.03, SD = 0.43, p = 0.008) and tBH (n = 3, m = 4.84, SD = 0.3, p = 0.09) reiative to the NR controi (n = 3, m = 5.7, SD = 0.62). For each treatment group, the mean 
spectrum is normaiized to the 18:0/18:2 peak (m/z 786) of NR, and the maximum/minimum peak heights at m/z 782 (brackets) indicate variation between 
bioiogicai repiicates. 

(G) CNS iipid peroxidation foiiowing 1 00 mM tBH treatment of iarvae raised on a 20 mM iinoieate diet. Co-incubation with the PUFA peroxidation sensor (Cl 1 - 
BODIPY 581/591) and LipidTOX 633 shows that the ratio of oxidized: non-oxidized sensor is iower in CNS iipid dropiets (white arrow) than in ceii membranes. 
Scaie bars, 10 |im. See aiso Figure S5. 
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stress increased 4-HNE protein adducts significantly in neuro- 
blasts, and this increase was further augmented by a high 
PUFA diet (Figures 7B and 7C). Cell-type markers indicated 
that Lsd-2 knockdown increases 4-HNE protein adducts not 
only in neuroblasts, but also in glia and in neurons (Figure 7B). 
Under these in vivo conditions, glial lipid droplet knockdown 
led to increased ROS throughout the CNS, with maximal levels 
in neuroblasts (Figure 7D). These findings demonstrate that glial 
lipid droplets are required to defend the proteins of neuroblasts 
and other neural cells against 4-HNE damage from even a mod- 
erate oxidative challenge. 

Finally, we tested whether a high PUFA diet influences the abil- 
ity of glial lipid droplets to protect neuroblast proliferation. With 
intermittent hypoxia on a high PUFA diet, a slight decrease in neu- 
roblast proliferation was observed, and this was exacerbated with 
glial Lsd-2 knockdown (Figure S6E). The milder oxidative stress of 
tBH on a high PUFA diet (or on a standard diet) did not significantly 
alter proliferation nor detectably increase CNS apoptosis in larvae 
of the control genotype (Figure 7E; data not shown). In larvae with 
glial knockdown of Lsd-2, however, proliferation was decreased 
slightly by tBH and much more strongly by the combination of 
tBH with high dietary PUFA (Figure 7E). This deficit in proliferation 
could be significantly rescued by dietary supplementation with 
the antioxidant AD4. Hence, when the diet is rich in PUFAs, glial 
lipid droplets play a particularly critical role in sustaining the pro- 
liferation of neuroblasts during oxidative stress. 

DISCUSSION 

This study reveals that the proliferation of Drosophila neural stem 
cells is defended against oxidative stress and identifies lipid 
droplets in niche glia as a critical element of the protective mech- 
anism. During oxidative stress, PUFAs and other fatty acids are 
redistributed from membrane phospholipids to lipid droplet 
TAGS. Unlike cell membranes, the lipid droplet core provides a 



protective environment that minimizes PUFA peroxidation chain 
reactions and limits ROS levels. This helps to safeguard not only 
the glial cells of the niche but also the neighboring neural stem 
cells and their neuronal progeny. We now compare the protec- 
tive roles of the niche glia during oxidative stress and starvation, 
discuss the lipid droplet antioxidant mechanism, and speculate 
upon its wider implications. 

Niche Glia Protect Neuroblasts from Oxidative Stress 
and Nutrient Deprivation 

For both NR and oxidative stress, the protection of neuroblast 
divisions involves niche glia. We previously showed that neural 
growth and proliferation are spared during NR by constitutive 
glial secretion of Jeb, a ligand for Alk/PI3-kinase signaling in neu- 
roblasts (Cheng et al., 2011). In the present study, we find that 
neuroblast proliferation is also spared during oxidative stress 
and that this requires the glial activity of proteins involved in 
TAG metabolism such as DGAT1 and Lsd-2. These enzymes 
promote the accumulation of PUFA and other fatty acids as lipid 
droplet TAGS in cortex and subperineurial glia. TAG synthesis is 
required in these niche glia for sustaining neuroblast proliferation 
during oxidative stress but this is not the case during NR or fed 
conditions. Hence, although niche glia are critical for protecting 
neighboring neuroblasts from nutrient deprivation and from 
oxidative stress, in each case a different molecular mechanism 
is involved. Presumably both glial mechanisms work together 
to give neuroblasts the remarkable ability to continue dividing 
even when oxidative stress is combined simultaneously with NR. 

A High PUFA Diet Increases the Vulnerability of 
Neuroblasts to ROS Damage 

We found that a high PUFA diet makes neural stem cells and their 
glial niche more vulnerable to ROS-induced damage. Omega-3 
and omega-6 PUFAs are obtained from the diet as both 
Drosophila and humans lack the key desaturases required to 



Figure 7. Lipid Droplets Protect Dividing Neuroblasts from PUFA Peroxidation 

(A) Lipid peroxidation following low-dose (10 mM) tBH treatment for 20 hr of larvae raised on a 20 mM linoleate (Cl 8:2) diet. Co-incubation with the PUFA 
peroxidation sensor (C11-BODIPY 581/591) and LipidTOX 633 shows an increase in the ratio of oxidized: non-oxidized sensor in neuroblast lineages (dotted 
outlines) and also in glia following the knockdown of lipid droplets {repo > Lsd-2 RNAi, control genotype: repo > w”^®). With repo > there is a low per- 
oxidation ratio in LipidTOX'^ droplets (arrow), but, with repo > Lsd-2 RNAi, there is a high peroxidation ratio in rare abnormal LipidTOX^ puncta (arrow). The same 
two genotypes are used in (B)-(E). Scale bars in this and subsequent panels, 10 ^im. 

(B) Glial lipid droplet knockdown increases 4-HNE protein adducts in many cells of the CNS, including neuroblasts (marked with anti-Miranda), neurons (marked 
with anti-Neurotactin), and glia (marked with anti-Repo). 

(C) Quantification of 4-HNE protein adducts in neuroblasts. Glial lipid droplet knockdown in the presence of 10 mM tBH (+ tBH), but not in NR controls (-tBH), 
leads to a significant increase in 4-HNE protein adducts. This is further increased when larvae are raised on a 20 mM linoleate diet. Mann-Whitney test: gray 
p > 0.1 , green p < 0.001 relative to the appropriate genotype control. 

(D) ROS increase following glial lipid droplet knockdown. Oxidized DHE levels increase significantly throughout the CNS (histogram) with elevated levels in glia 
(blue arrows) and in neuroblasts (white arrows). 

(E) Neuroblast proliferation is significantly inhibited by 1 0 mM tBH if glial lipid droplets are knocked down. Inhibition is stronger when larvae are raised on a 20 mM 
linoleate diet and can be significantly rescued by dietary supplementation with 40 |ag/ml AD4. In the control genotype, tBH and 20 mM dietary linoleate do not 
significantly decrease neuroblast proliferation. Pooled data are represented as mean + SD. 

(F) PUFA protection model for the antioxidant role of lipid droplets. The neural stem cell niche (glia) and the neural stem cell (neuroblast) are depicted in the 
presence of oxidative stress, in a CNS that is wild-type (left) or RNAi knockdown for a glial lipid droplet gene (right). Oxidative stress stimulates the biogenesis of 
glial lipid droplets that protect vulnerable PUFAs in the TAG core from ROS-induced peroxidation. Peroxidation of omega-6 PUFAs produces 4-HNE, which, in 
turn, can generate more ROS, leading to PUFA chain reactions that damage membrane lipids and other macromolecules such as proteins. Following oxidative 
stress, the proportion of total PUFAs and other fatty acids increases in the TAG pool, relative to the phosphatidylcholine pool. This may correspond to bulk 
redistribution of lipids from membranes to lipid droplets, which would decrease the amount of membranes and thus minimize the fraction of PUFAs that are 
susceptible to ROS-induced peroxidation. The mechanism by which ROS and/or 4-HNE inhibit stem cell proliferation and the molecule(s) communicating 
between the niche and the stem cell are not yet clear (dotted arrows). 

See Discussion for details. See also Figure S6. 
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synthesize them. These PUFAs are essential fatty acids for 
humans but they do not appear to be required for Drosophila 
survival (Rapport et al., 1983). Larvae do, however, consume 
significant quantities of the omega-6 PUFA linoleic acid and 
increasing its concentration in the diet makes neuroblasts and 
other developing neural cells more susceptible to damage from 
ROS-induced lipid peroxidation chain reactions. Glial lipid drop- 
lets make an important contribution toward minimizing this ROS 
damage in the CNS by storing linoleic acid and protecting it from 
peroxidation. Saturated and monounsaturated fatty acids are 
much less vulnerable than PUFAs to peroxidation, but they 
too accumulate in glial lipid droplets during oxidative stress. 
Whether these fatty acids contribute to the protection of PUFAs 
in lipid droplets is not yet clear but, in the membranes of cancer 
cells, they are thought to decrease ROS damage by diluting 
PUFAs in the phospholipid pool (Rysman et al., 2010). It may 
also be important to avoid excess saturated fatty acids in mem- 
brane phospholipids, and the scavenging of extracellular mono- 
unsaturated lysophospholipids provides one way of preventing 
this (Kamphorst et al., 2013). 

An Antioxidant Role for the Lipid Droplet 

It is becoming increasingly clear that lipid droplets mediate 
cellular functions other than fat storage and mobilization relevant 
for energy homeostasis. For example, lipid droplets can partici- 
pate in protein degradation, histone storage, viral replication, 
and antibacterial defense (Anand et al., 2012; Walther and 
Farese, 2012). Our study now reveals an additional role for lipid 
droplets as an antioxidant organelle, defending membrane 
PUFAs from damage by ROS-induced peroxidation. We also 
found that lipid droplets play a wider antioxidant role, protecting 
cellular proteins from reactive peroxidation products such as 
4-HNE. The results presented here support a PUFA protection 
model for the antioxidant mechanism of lipid droplets (Figure 7F). 
Central to this model are five key findings. First, DGAT1 and 
Lsd-2 knockdowns and AD4 rescues show that glial lipid drop- 
lets act during oxidative stress to minimize ROS, PUFA peroxi- 
dation, and 4-HNE damage and also to sustain neuroblast prolif- 
eration. Second, lipid peroxidation sensor and tandem mass 
spectrometry experiments argue that the major diet-derived 
PUFA, linoleic acid, is less vulnerable to peroxidation in lipid 
droplet TAGS than in membrane phospholipids. Third, Catalase 
or SOD2 overexpression or NAC supplementation provides evi- 
dence that ROS (directly or indirectly) induce lipid droplets dur- 
ing oxidative stress. Fourth, MIMS, tandem mass spectrometry, 
PLD knockdown, and CNS explants demonstrate that saturated, 
monounsaturated, and polyunsaturated fatty acids redistribute 
from membrane phospholipids to lipid droplet TAGs during 
oxidative stress. And fifth, in vitro and in vivo experiments 
together show that pro-oxidants not only stimulate widespread 
ROS and PUFA peroxidation but also that 4-HNE, a product of 
PUFA peroxidation, is itself capable of increasing ROS, particu- 
larly in neuroblasts. This identifies a positive feedback loop in the 
CNS between ROS, PUFA peroxidation, and 4-HNE but also 
suggests that it will be difficult to pinpoint the molecule(s) prop- 
agating oxidative damage between glia and neuroblasts. 

Our findings raise the important issue of why PUFAs are more 
efficiently protected from peroxidation in the core of lipid droplets 



than in membranes. Future biophysical investigations will be 
needed to resolve this but it is likely that lipid droplets provide a 
means to segregate reactive PUFAs away from aqueous pro-ox- 
idants and lipid peroxidation chain reactions at membrane- 
aqueous interfaces. Studies of oil-and-water emulsions indicate 
that TAGs in small lipid droplets are less mobile and thus better 
protected from peroxidation chain reactions than those in larger 
lipid droplets (Nakaya et al., 2005). Interestingly, lipid droplets in 
glia are smaller, on average, than those found in the major lipid 
storage depot of the fat body. Given this, and that their biogenesis 
in glia requires DGAT 1 , it is interesting that a previous study found 
that DGAT1 and DGAT2 are involved in the formation of small 
versus large lipid droplets, respectively (Wilfling et al., 2013). 

Lipid droplets have been observed in many different cell types 
challenged with hypoxia, ischemia, or various metabolic imbal- 
ances. For example, mitochondrial protein knockdowns in 
Drosophila photoreceptor neurons generate ROS and induce 
lipid droplets in neighboring pigment and epithelial glia (Liu 
et al., 2015). In this mutant context, either neuronal or glial-spe- 
cific overexpression of lipases can decrease lipid droplets and 
ameliorate photoreceptor degeneration. In a mammalian 
example, hypoxia induces lipid droplets in glioblastoma and 
breast cancer cells via a pathway involving HIF-1a (Bensaad 
et al., 2014). This hypoxic induction mechanism in cancer cells 
appears different from the HIF-1 independent pathway that we 
have identified in the Drosophila neural stem cell niche. It is 
nevertheless intriguing that, in both contexts, inhibiting lipid 
droplet formation increases ROS toxicity and impairs cell prolif- 
eration. It will therefore be important in future to determine 
whether lipid droplets also play antioxidant roles in other biolog- 
ical contexts. 

EXPERIMENTAL PROCEDURES 

Larval Environmental Manipulations and EdU Labeiing 

Larvae at 0-6 hr after larval hatching (ALH) were raised, 20 per vial, at 25°C on 
standard corn meal/yeast/agar food (composition in Gutierrez et al., 2007) until 
mid-third instar (66-72 hr ALH), then subjected to environmental stresses. For 
NR, larvae were floated from standard diet using 30% glycerol/PBS and then 
transferred to 0.5% low melting-point agarose. N-acetyl cysteine (pH 7.2) and/ 
or pro-oxidants were added to agarose at 38°C, before cooling. Stearic or 
linoleic acid (Sigma-Aldrich) was supplemented to 20 mM in standard food. 
Hypoxia used an O 2 Cabinet (Coy Lab Products) calibrated to 0% and 
20.9% oxygen, with 2.5% oxygen maintained by automated nitrogen injection. 
Intermittent hypoxia used alternate injections of nitrogen and air over multiple 
cycles. For the in vitro EdU assay, dissected tissues were incubated at 25°C 
for 1 hr in PBS with 10 |am EdU. For the in vivo EdU assay, larvae were trans- 
ferred to 0.5% low melting-point agarose with 100 |am EdU for 3 hr and 
stresses then applied in the presence of EdU for a further 19 hr. 

Tissue Imaging and Anaiysis 

Tissues were fixed and stained for EdU with the Click-iT EdU Alexa Fluor 555 
Imaging Kit (Molecular Probes) and for neutral lipids using oil red O or LipidTOX 
(Molecular Probes). Live ex vivo imaging was used to detect oxidized DHE 
(Invitrogen) and peroxidated lipid (C11-BODIPY 581/591, Invitrogen). See 
Supplemental Experimental Procedures for details. 

Lipid Anaiysis by MIMS, Tandem MS, and GC-MS 

For Multi-Isotope Imaging Mass Spectrometry (MIMS), larvae were fed diets 
containing 1-^^C Glucose, 1-^^C acetate, or ^^C-U-linoleic acid (Cambridge 
Isotope Laboratories), and after hypoxia (2.5% oxygen for 22 hr) CNSs 
were dissected, fixed, embedded in Technovit 7100 (Heraeus Kulzer), and 
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sectioned at 2 lam. Analysis was performed on 30- to 50-|am regions of interest 
with simultaneous imaging of multiple ions, including and 

Tandem mass spectrometry (nanospray infusion) and gas chromatography 
mass spectrometry were performed on lipid extracts of 6-15 CNSs. See Sup- 
plemental Experimental Procedures for details. 

SUPPLEMENTAL INFORMATION 

Supplemental Information includes Supplemental Experimental Procedures, 
six figures, and one movie and can be found with this article online at http:// 
dx.doi.org/1 0. 1 01 6/j.cell.201 5.09.020. 
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SUMMARY 

Infections have been proposed as initiating factors 
for inflammatory disorders; however, identifying as- 
sociations between defined infectious agents and 
the initiation of chronic disease has remained 
elusive. Here, we report that a single acute infection 
can have dramatic and long-term consequences for 
tissue-specific immunity. Following clearance of 
Yersinia pseudotuberculosis, sustained inflamma- 
tion and associated lymphatic leakage in the 
mesenteric adipose tissue deviates migratory den- 
dritic cells to the adipose compartment, thereby 
preventing their accumulation in the mesenteric 
lymph node. As a consequence, canonical mucosal 
immune functions, including tolerance and protec- 
tive immunity, are persistently compromised. 
Post-resolution of infection, signals derived from 
the microbiota maintain inflammatory mesentery 
remodeling and consequently, transient ablation 
of the microbiota restores mucosal immunity. 
Our results indicate that persistent disruption of 
communication between tissues and the immune 
system following clearance of an acute infection 
represents an inflection point beyond which tissue 
homeostasis and immunity is compromised for 
the long-term. 

354 Cell 163, 354-366, October 8, 2015 ©2015 Elsevier Inc. 



INTRODUCTION 

The immune system deals with highly diverse infectious chal- 
lenges in a manner that both promotes the control of invading 
agents and restores tissue homeostasis. At barrier tissues, the 
sites of constitutive microbial exposure, tissue-specific immunity 
requires defined structural components that provide spatial 
segregation from the microbiota to ensure maintenance of organ 
function and enable the balance between tolerance to environ- 
mental antigens and regulated protective immunity (Mowat and 
Agace, 2014). These balanced processes are essential for the 
preservation of tissue integrity. Inflammatory diseases affecting 
barrier sites develop as a result of a breakdown in tissue homeo- 
stasis and the subsequent failure to regulate immune responses 
to environmental or microbial antigens (Belkaid and Segre, 201 4; 
Maloy and Powrie, 2011). 

Acute infections represent frequent and violent tissue pertur- 
bations from which the immune system must rapidly rebound. 
It is estimated that >70% of all people in the United States expe- 
rience a respiratory tract infection each year, and the average 
child will suffer ten diarrheal episodes before the age of 5, a num- 
ber that is dramatically higher in low to middle income countries 
(Fendrick et al., 2003; Kosek et al., 2003; Vernacchio et al., 2006). 
At barrier sites in particular, acute infections represent highly 
volatile situations, with pathogens, commensals, and environ- 
mental antigens transiently sharing the same inflamed environ- 
ment. As a consequence, in the gastrointestinal (Gl) tract, acute 
mucosal infections are characterized by dysbiosis associated 
with significant shifts in the microbiota and dominance of 
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bacteria with enhanced invasive capabilities and other proper- 
ties that can directly exacerbate inflammation and tissue dam- 
age (Belkaid and Hand, 2014). Because regulatory responses 
are often transiently neutralized during acute infection, these 
encounters can also blur the distinction between benign constit- 
uents and pathogenic organisms (DePaolo et al., 2011; Hand 
et al., 2012; Oldenhove et al., 2009). What remains unclear is 
whether acute infections produce persistent changes that nega- 
tively impact immune function, and if so, what are the mecha- 
nisms of such effects? 

These questions are especially compelling in light of the 
rising incidence of inflammatory disorders, particularly involving 
those diseases affecting barrier tissues such as inflammatory 
bowel disease, psoriasis, allergy, and asthma (Molodecky 
et al., 2012; Salgame et al., 2013). A number of environmental 
stressors, including infections have been invoked as possible 
triggers. Chronic infections in particular can have significant 
bystander consequences on the capacity of the host to respond 
to subsequent challenges (Barton et al., 2007; Cadwell et al., 
2010; Jamieson et al., 2013; Osborne et al., 2014; Salgame 
et al., 2013). However, identifying direct associations between 
defined infectious agents and the initiation of chronic disease 
has remained difficult. One possible explanation for this difficulty 
in establishing cause-effect relationships could be that infections 
associated with the breakdown of tissue homeostasis may not 
be temporally coincident with the eruption of symptomatic dis- 
ease, and previously cleared infections could have long-term 
and cumulative effects on the immune system. 

Here, we show that a single transient encounter with a Gl 
pathogen. Yersinia pseudotubercuiosis, can have dramatic 
and persistent consequences on tissue-specific immunity. 
Infection-induced remodeling of the mesentery and increased 
leakage of lymphatic vessels persistently interrupts the immune 
dialog between the gut and associated lymphoid tissue. These 
sequelae of a cured infection interfere with the capacity of the 
host to develop both tolerance and adaptive immunity to oral 
antigens and are sustained by the microbiota. Thus, in defined 
infectious settings, the immune system can reach an inflection 
point beyond which restoration of tissue homeostasis and immu- 
nity requires intervention. These results reveal that acute infec- 
tions could impose “immunological scarring” that may have 
cumulative and long-term consequences. This work provides a 
framework for understanding how previously experienced infec- 
tious stressors that are no longer present in the host can cause 
localized immune damage that may contribute to the breakdown 
of tissue-specific immunity and the emergence of complex 
diseases. 

RESULTS 

Oral Infection with Yersinia pseudotubercuiosis Induces 
Chronic Mesenteric Lymphadenopathy 

Yersinia pseudotubercuiosis is a foodborne Gram-negative bac- 
terium that, in humans, can cause a range of Gl syndromes from 
acute enteritis to mesenteric lymphadenitis and pseudoappendi- 
citis (Asano, 2012; Wren, 2003). In mice, oral infection with 10^ 
colony-forming units (CPU) of Y. pseudotubercuiosis (IP32777) 
caused a transient infection, with the peak of bacterial burden 



at day 7 post-infection (Figure 1 A) (McPhee et al., 2012; Simonet 
and Falkow, 1992). As previously described (Fahlgren et al., 
201 4), systemic control of the infection was achieved by 3 weeks 
post-infection in most animals (Figures 1A and SI A). T cell re- 
sponses are important for the control of Y. pseudotubercuiosis 
infection (Bergman et al., 2009; Bergsbaken and Bevan, 2015; 
Zhang et al., 2012) and the number of IFN-y-producing anti- 
gen-specific (YopE 69 - 77 ) CD8^ T cells increased significantly 
by 2 weeks post-infection before contracting, in concert with 
clearance of the bacteria (Figures SI B and SIC). 

Concurrent with the peak of bacterial burden, 
Y. pseudotubercuiosis induced a rapid influx of neutrophils and 
monocytes into all infected tissues (Figures 1 B-1 D and data not 
shown). This increase in phagocytic cells largely subsides in all 
compartments, except for the MLN (Figures 1B-1D and data 
not shown). Despite efficient control of the bacteria, a significant 
fraction of infected mice (70%) developed chronic mesenteric 
lymphadenopathy (CL"") defined by 3- to 4-fold tissue enlarge- 
ment, the formation of central abscesses, the presence of foamy 
macrophages, and significant collagen deposition (Figures 1E- 
1H and SID). All of these features are reminiscent of the MLN 
pathology that can be observed during human infections with 
Y. pseudotubercuiosis (Asano, 2012). Lymphadenopathy was 
highly restricted to the MLN and still detectable 9 months post- 
resolution of the infection (Figures 1 E and 1 1). This response 
was not associated with persistent infection of the gut, MLN, 
and mesenteric adipose tissue (MAT) (Figures 1 A and SI A). How- 
ever, CL'^ lymph nodes were not sterile and bacteria (the vast ma- 
jority belonging to the genus Lactobacillus) could be grown post- 
resolution of the infection (Figures S1E and S1F). Thus, acute 
infection with Y. pseudotubercuiosis can induce long-term and 
localized damage to gut-associated lymphoid structures. 

Impaired Tissue-Specific Adaptive Immunity Post 
y. pseudotubercuiosis Infection 

The profound disruption of MLN structure following 
Y. pseudotubercuiosis infection led us to explore the possibility 
that the homeostatic immune dialog between mucosal antigens 
and the gut-associated secondary lymphoid tissue may be 
compromised. To address this point, we first explored the poten- 
tial impact of lymphoid tissue remodeling following infection on 
the acquisition of oral tolerance, a process that is largely regu- 
lated by the induction of antigen-specific peripheral Treg cells 
(pTreg) (Coombes et al., 2007; Hadis et al., 2011; Sun et al., 
2007). To this end, T cells from Rag1~'~ OT-II TCR-transgenic 
mice were transferred into mice fed with ovalbumin (OVA). Con- 
trols included naive hosts as well as mice that controlled 
Y. pseudotubercuiosis without developing lymphadenopathy 
(CL“). In naive OVA-fed mice, a significant fraction of OT-II cells 
accumulating in the GALT expressed Foxp3 (Figures 2A and 2B). 
Furthermore, >30% of pTreg cells co-expressed GATA3 (Fig- 
ure 2A), a transcription factor associated with Treg fitness 
(Wang et al., 201 1 ; Wohifert et al., 2011). In contrast, pTreg gen- 
eration was significantly impaired in mice harboring enlarged 
MLNs (CL^) compared to naive or CL“ mice (Figures 2A and 
2B). The percentage of Treg cells expressing GATA3 was also 
significantly decreased in CL"^ compared to control mice (Figures 
2A and 2B). Acquisition of tolerance to orally fed antigens can be 
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Figure 1 . Oral Infection with Y. pseudotuberculosis Induces Persistent Mesenteric Lymphadenopathy 

C57BL/6 mice were orally infected with Y. pseudotuberculosis (YP). 

(A) Infectious burden in the spleen, MLN, and liver at indicated time points. 

(B and C) Numbers of neutrophils and inflammatory monocytes from siLP, spleen, liver, and MLN. 

(D) Representative flow cytometric contour plots indicating the percentage of neutrophils in the MLNs of naive and infected mice. 

(E) Images on the left depict the gastrointestinal tract, mesenteric adipose tissue (MAT), and MLN (dotted lines) from naive and infected mice. Images on the right 
show (from top) the axillary, brachial, lumbar, inguinal, and mesenteric lymph nodes. 

(F) Weight of MLN from naive (white circles) or infected mice with chronic lymphadenopathy (CL'^) (black circles) and without lymphadenopathy (CL“) (gray 
circles). 

(G) Compilation of 15 separate experiments involving week 4 to week 9 infected mice showing the percent of animals with lymphadenopathy. 

(H) Histology of naive and CL"^ MLNs stained with picrosirius red. Large arrows show abscesses, arrowheads indicate areas of collagen deposition. 

(I) Number of neutrophils in the MLN at day 300 post-infection. All bar graphs show the mean ± SEM. All data shown, except (I) and (G), is representative of three to 
six experiments, each containing three to five naive and six to ten infected animals. Data represented in (I) are representative of 2 experiments with three naive and 
three to nine infected animals. *p < 0.05, **p < 0.005 compared to naive mice (Student’s t test). 

See also Figure SI . 



measured by assessing immune responses following challenge 
at peripheral sites (Curotto de Lafaille et al., 2008; Weiner 
et al., 2011; Worbs et al., 2006). In contrast to naive mice, CL^ 
mice fed with OVA failed to acquire oral tolerance and developed 
a delayed type hypersensitivity response to OVA challenge (Fig- 
ure 20). Thus, mesenteric lymphadenopathy induced by infec- 
tion is associated with impaired acquisition of oral tolerance to 
food antigens. 

We next assessed whether lymphadenopathy could poten- 
tially interfere with the induction of mucosal effector responses. 



To this end, we utilized a model of oral vaccination combining 
a mixture of OVA and an attenuated double mutant of the 
heat-labile enterotoxin of enterotoxigenic Escherichia coii (LT 
R192G/L211A; hereafter referred to as dmLT) (Hall et al., 
2008). As previously shown, this regimen induces robust OVA 
and dmLT-specific Th17 cells (Hall et al., 2008) (Figures 2D 
and 2E). The induction of Th17 cells specific for dmLT or 
OVA was dramatically impaired in CL"^ mice compared to 
controls (Figures 2D and 2E). Induction of dmLT-specific and 
OVA-specific IgA was also significantly reduced in CL"^ mice 
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Figure 2. Infection-Induced Mesenteric Lymphadenopathy Is Associated with Disruption of Mucosal Immunity 

(A and B) Naive or 4-week infected C57BL/6 (CD45.2) mice were transferred with CD45.1^ Rag1~'~ OT-ii TCP transgenic T ceiis (CD45.1), fed ovaibumin 
(OVA) in the drinking water, and ceiis from the siLP and MLNs were isoiated. (A) Representative flow cytometric contour piots of Foxp3 and GATA3 expression 
by OT-ii T ceiis isoiated from the siLP of naive and infected mice with or without iymphadenopathy (YP CL'^ and YP CL“, respectiveiy). Numbers represent 
the mean percentage within the gate (±SEM) of aii sampies in this experiment. (B) Numbers of OTii Foxp3^ and OT-ii Foxp3^GATA3^ T ceiis in the siLP 
and MLN. 

(0) Top: scheme for induction of orai toierance. CFA, compiete Freund’s adjuvant. Bottom: footpad sweiiing was measured in the feet of sensitized mice after 
chaiienge with OVA. 

(D-F) Naive, YP CL“ and YP CL"^ mice were immunized oraiiy with OVA and doubie mutant heat iabiie toxin (dmLT). (D and E) Seven days after immunization, 
lymphocytes were isolated from the siLP and stimuiated in vitro with DCs puised with dmLT or OVA to measure T ceii responses. (D) Fiow cytometric 
contour piots show representative popuiations of antigen-specific iFN-y and iL-17A-producing CD4'^ T ceiis from naive, vaccinated (CtriA/ax), or 
Y. psetvc/oftvbe/'ctv/os/s- infected vaccinated (YP CL^^“A/ax) mice. Numbers in piots represent the mean frequency of ceiis within the adjacent gate (±SEM) of aii 
sampies within this experiment. (E) Numbers of iL-17A-producing CD4'^ T ceiis from (D). (F) Measurement of dmLT and OVA-specific fecai igA by ELiSA. OD, 
opticai density. 

(legend continued on next page) 
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compared to vaccinated controls (Figure 2F). Infection with 
Y. pseudotuberculosis also modestly modifies the fecal and 
small intestinal composition of the microbiota (Figures 2G and 
2H). Of note, no differences were found between CL"^ versus 
CL“ mice (Figure 2H). 

Together, these results reveal that a single acute infection can 
chronically alter the ability of the immune system to develop ca- 
nonical mucosal responses necessary for maintenance of gut 
immune homeostasis. 

Chronic Mesenteric Lymphadenopathy Is Associated 
with Defects in Dendritic Cell Migration 

Effective activity of the mucosal immune system relies upon effi- 
cient trafficking of DCs to the MLN to ensure the maintenance of 
homeostasis with respect to antigens acquired from the intesti- 
nal lumen (Hooper and Macpherson, 2010). We first examined 
the spatial organization of the MLN following infection with 
Y. pseudotuberculosis. Consistent with flow cytometry analysis 
(Figures 1B-1D), confocal imaging revealed a significant accu- 
mulation of LysM-eGFP"^ cells at the center of the MLN (Fig- 
ure 3A). The central T cell zone was displaced and distributed 
in a disorganized manner at the periphery of the MLN of CL'^ (Fig- 
ures 3A and S2A). Additionally, B cell follicles and lymphatic ves- 
sels that, in naive mice, localize at the periphery of the MLN, were 
located at the edge of the abscess (Figure 3A). Naive T cells were 
still present in the MLN post-infection, suggesting that some 
MLN functions, such as the trafficking of lymphocytes from the 
blood, may have been preserved (Figure S2B). 

In the gut, immune dialog is mediated bythree main antigen pre- 
senting cells that can be segregated based on their expression of 
integrin aE (GDI 03) and aM (GD1 1 b) (Figure S2C) (Bekiaris et al., 
2014; Bogunovic et al., 2009; Grainger et al., 2014). In particular, 
the migratory GDI 03-expressing DC subsets have been associ- 
ated with the induction of Th17 and Treg cells (Coombes et al., 
2007; Satpathy et al., 2013; Schlitzer et al., 2013; Schulz et al., 
2009; Sun et al., 2007; Varol et al., 2009). The frequency and abso- 
lute number of migratory GDI 03^CD1 1 b^ DCs were dramatically 
reduced in the MLN of CL^ mice compared to naive and CL“ mice 
(Figures 3B and 3C). GDI 03'^CD1 1 b“ DCs in the MLN were also 
reduced, albeit to a lesser degree (Figures 3B and 3C). For the 
remainder of this study, we focused on the migratory 
CD103^CD1 1 b^ DC subset, whose unique phenotype allows for 
faithful tracking during inflammation. Importantly, the defect in 
CD103^CD11b^ DCs was restricted to the MLN as gut lamina 
propria DC subsets were unchanged in CL^ mice versus controls 
(Figure 3D). These results support the idea that lack of migratory 
CD103^CD1 1 b^ DCs in the MLN was not the consequence of a 
defect in DC development or gut homing, but the result of impaired 
migration and/or accumulation in the MLN. 

Extraction of cells from the MLNs of CL^ mice could be biased 
because of the high level of collagen deposition (Figure 1H). 
Histo-cytometry allows for the identification, spatial positioning. 



and quantitative measurement of cells within their host tissue 
(Gerner et al., 2012). Because GDI 03 is highly sensitive to fixa- 
tion, we devised an alternative gating strategy to identify 
CD103^CD11b'^ DCs by histo-cytometry (Figures S3A and 
S3B) (Merad et al., 2013). As previously described (Gerner 
et al., 2012; Kissenpfennig et al., 2005), migratory and resident 
DC subsets localized in distinct regions of naive LNs (Figures 
3E, 3G, and S3C). This approach allowed us to describe the 
localization of the migratory CD103'^CD1 1 b^ DCs in the MLN 
under steady-state conditions as positioned peripheral to 
CD103^CD11b“ DCs on the edge of the putative T cell zone 
and in close proximity to the B cell follicles (Figures 3E and 
3G). Following infection, the CD103‘^CD1 1 b^ DCs were virtually 
absent from CL"^ MLNs compared to controls (Figures 3E-3G, 
S3C, and S3D). The distribution of other DC subsets was not 
altered (Figures 3E, S3C, and S3D). Thus, two complementary 
approaches revealed a sustained defect in the accumulation of 
migratory DCs in the MLN post-infection. 

Acute Infection Alters Lymphatic Integrity and 
Mesenteric Adipose Tissue Homeostasis 

Notwithstanding the direct infection-related structural disruption 
of the MLN, our results thus far support the idea that impaired 
mucosal immunity following infection may be a consequence of 
a selective defect in the capacity of migratory DCs to access the 
MLN. In addition to cell trafficking, the gut-associated lymphatic 
system is also responsible for the transport of lipids. We assessed 
the physical integrity of gut-draining lymphatic vessels by orally 
administering the fluorescent long-chain fatty acid, Bodipy FL 
C16 (Khalifeh-Soltani et al., 2014; Randolph and Miller, 2014). In 
contrast to control mice, Bodipy FL Cl 6 leaked into the surround- 
ing mesenteric adipose tissue (MAT) of CL"^ mice, revealing a ma- 
jor defect in lymphatic containment, a defect that was sustained 
up to 1 0 months post-infection (Figures 4A and S4A). 

We next assessed how the loss of lymphatic integrity could 
impact MAT immune homeostasis. Imaging of the MAT revealed 
a massive infiltrate of hematopoietic cells in CL"^, but not CL“ 
mice (Figure 4B). This infiltrate was associated with a significant 
increase in neutrophils and macrophages while the number of 
type 2 immune cells including eosinophils and ILC2s, was signif- 
icantly reduced (Figure 4C). The MAT shifted toward type 1 immu- 
nity, with a significant increase in the transcription of inflamma- 
tory mediators, including, Ifng, 111 a, 111b, Tnfa, 1112, Fpri , Fpr2, 
antimicrobial molecules, and chemokines (Figures 4D and S4B; 
Table SI). Alteration of lymphatic integrity was not unique to 
Y. pseudotuberculosis, as is demonstrated by leakage in models 
of T cell transfer colitis and Toxoplasma gondii infection (Figures 
S4C and S4D). 

At steady state, the MAT contains CD103^CD11b“ and 
CD103“CD11b-^ DCs, while CD103-^CD1 1 b^ DCs are largely 
absent from this compartment (Figures 5A, 5B, and S4E). In 
contrast, the frequency and number of CD103'^CD1 1 b"^ DCs 



(G) Bar graphs showing the fraction of the fecai microbiota represented by individuai operationai taxonomic units (OTUs) identified by 16S bacteriai gene 
sequencing at the time points indicated. 

(H) Principal coordinate analysis of 1 6S gene sequencing data derived from fecal and small intestinal samples (weighted UniFrac). All experiments, except (G) and 
(H), are representative of three to five separate experiments containing five control mice and five to ten infected mice per group. All graphs show the mean ± SEM. 
*p < 0.05, **p < 0.005, ***p < 0.0005 (Student’s t test). 
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Figure 3. Mesenteric Lymphadenopathy Is Associated with a Reduction in Migratory DCs in the MLN 

(A) LysM-eGFP reporter mice were infected with Y. pseudotuberculosis. At 4 weeks post-infection, sections of naive or CL"^ MLNs were analyzed by confocal 
microscopy. 

(B-D) Bar graphs show the numbers of each DC subset (gated as described in Figure S2C), isolated from the MLN (B) and siLP (D) at the time points indicated. (C) 
Representative flow cytometric contour plots of DC subsets from the MLN and siLP. Numbers in plot represent the mean frequency of cells within the adjacent 
gate (±SEM) of all samples within this experiment. 

(E-G) MLN sections from naive or 4-week infected CX3CRI-GFP reporter mice were stained for analysis by multiparameter confocal microscopy. (E) Three- 
dimensional “Cell-of-Interest” surfaces were generated based on CD11c expression and quantitatively analyzed by histo-cytometry (Figure S3) to identify the X 
and Y positions of individual cells within the antigen-presenting cell subsets (labels below image). (F) Representative histograms obtained from the histo- 
cytometric analysis indicating the expression of CD1 1 b^ in cells occupying the CD1 1 MHC ll'^'^'^ CX3CRI “ CD64“ gate (Figure S3). (G) Representative confocal 
image of MLN sections stained for Collagen IV and CD8a overlaid with the position of CD11c^ MHC ll'^'^'^ CX3CR1“ CD64“ CD11b^ DCs (analogous to 
CD1 1 bXD103'^ DCs) reconstructed by histo-cytometry. Dotted blue lines represent the B cell zones and red lines indicate the T cell zones. Images in (A), (E), and 
(G) are representative of two separate experiments. Flow cytometry data from (B)-(D) are representative of five experiments each containing five control animals 
and five to ten infected animals. All bar graphs show the mean ± (SEM). *p < 0.05, **p < 0.005, ***p < 0.0005 (Student’s t test). CL'^, infected mice with 
lymphadenopathy; CL“, infected mice without lymphadenopathy. 

See also Figures S2 and S3. 
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Figure 4. Yersinia pseudotubercuiosis Infection Is Associated with Lymphatic Leakage and Mesenteric Adipose Tissue Remodeling 

(A) Naive or 4-week infected mice were gavaged with Bodipy FL C16. Shown are representative fluorescent microscope images of the MAT (top) or high- 
magnification confocal images of individual MAT lymphatics overlaid with the bright field (bottom). 

(B) MAT samples from naive or infected CX3CRI-GFP mice were stained and imaged by confocal microscopy. 

(C) Numbers of ILC2s, eosinophils, neutrophils, and macrophages in the MAT determined by flow cytometry. 

(D) Heat map of gene expression from CD45^ cells isolated from the MAT at the indicated time points and analyzed by NanoString technology. Images in (A) and 
(B) are representative of three experiments. Data from (C) are representative of three to five experiments each containing five control animals and five to ten 
infected animals. Data in (D) are representative of two separate experiments. All bar graphs show the mean (±SEM). *p < 0.05, **p < 0.005, ***p < 0.0005 (Student’s 
t test). CL"^, infected mice with lymphadenopathy; CL“, infected mice without lymphadenopathy. 

See also Figure S4 and Table S1 . 



dramatically increased in the MAT post-infection, an effect that 
persisted for at least 1 0 months post-infection (Figures 5A and 
5B). We next examined whether migratory DCs had extravasated 
into the adipose tissue by confocal microscopy. While, in control 



mice few CD1 1 c'^ and/or CD1 1 cells resided in the adipose tis- 
sue, in CL'^ mice, antigen-presenting cells accumulated in the 
MAT particularly in areas surrounding the lymphatic vessels (Fig- 
ures 5C, 5D, and S4F). These results support the idea that 
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Figure 5. CD103^CD11b^ DCs Accumulate 
in the MAT following Infection with Yersinia 
pseudotubercuiosis 

(A) DC subsets in the MAT as characterized by 
flow cytometry, according to the gate strategy 
described in Figure S4E. Shown are representative 
contour plots of MAT DC subsets from naive or 
infected animals (CL^^“). Numbers in plot indicate 
the mean frequency of cells (±SEM) within the 
adjacent gate from all samples in this experiment. 
Bar graph shows the number of CD103^CD11b^ 
DCs in the MAT. 

(B) Bar graphs show the frequency and number 
of CD103^CD11b^ DCs in the MAT at 42 weeks 
post-infection. 

(C) Representative whole tissue fluorescent im- 
ages of MAT from naive or 4-week infected 
CD11C-YFP reporter mice. 

(D) Representative confocal images describing 
the localization of CD11c^CD11b^MHC ir cells in 
relation to lymphatics (LYVEI"^) in the MAT. All the 
data are representative of three experiments each 
containing five control animals and five to ten in- 
fected animals. All bar graphs show the mean 
(+SEM). *p < 0.05, ***p < 0.0005 (Student’s t test). 
CL^, infected mice with lymphadenopathy; CL“, 
infected mice without lymphadenopathy. 

See also Figure S4. 
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following infection, migratory DCs prematurely escape the 
lymphatic flow and are shunted to the adipose tissue. 

Microbiota Sustains Impaired Tissue-Specific Immunity 
following Infection 

We next explored the possibility that the resident microbiota 
might sustain tissue remodeling and inflammation following res- 
olution of the infection. To address this possibility, we assessed 
the consequences of Y. pseudotuberculosis infection in mice 
devoid of live microbes (germ-free [GF]). Like specific path- 
ogen-free (SPF) mice, GF mice cleared the bacteria from the 
MLN, liver, and spleen, but, in contrast to SPF mice, not in the 
Gl tract (Figures S5A and S5B). This observation is consistent 
with the known role for the microbiota as an agent of colonization 
resistance (Buffie and Pamer, 2013). GF mice still developed 
lymphadenopathy, albeit to a lesser magnitude than in SPF 
mice, revealing that the microbiota was not strictly required for 
infection-induced remodeling of the MLN (Figure 6A). On the 
other hand, Y. pseudotuberculosis had no impact on the 
capacity of CDIOS^CDIIb^ DCs to accumulate in the MLN of 
GF mice, highlighting a central role for the microbiota in the in- 
duction and/or maintenance of this specific sequela of infection 
(Figure 6B). 

We next evaluated whether a transient ablation of the micro- 
biota in SPF mice could reduce MAT inflammation and restore 



mucosal immune function. Antibiotic 
treatment significantly reduced type I 
inflammation of the MAT, prevented the 
aberrant accumulation of GDI OS^CD1 1 b"^ 
DCs in the MAT, and restored their phys- 
iological homing to the MLN (Figures 6C and 6D). However, while 
MAT inflammation was reduced following antibiotic treatment, 
lymphatic integrity was not fully restored (Figure S5C). Oral 
gavage of such mice with microbial-derived products was suffi- 
cient to restore cellular infiltration into the MAT, (Figures S5D and 
S5E) suggesting that the effect may be mediated by enhanced 
microbial exposure independent of any defined microbiota. 
Indeed, mice developing lymphadenopathy can be found in the 
same cage as CL“ mice (Figure S5F). 

We next assessed if antibiotic treatment was sufficient to 
restore mucosal immunity. As previously described, antibiotic 
treatment reduced the efficacy of oral vaccination (Hall et al., 
2008) (Figures 6E and S5G). Nonetheless, a substantial number 
of Th17 cells specific for dmLT or OVA were still induced and 
accumulated in the lamina propria of vaccinated mice (Fig- 
ure 6E). Together with restoration of MAT homeostasis, antibi- 
otic treatment also restored the capacity of CL^ mice to respond 
to oral vaccination to a level comparable to naive antibiotic- 
treated mice (Figure 6E). Thus, damage induced by acute 
infections may be reversible and limiting exposure to defined 
microbial products while promoting lymphatic repair may 
represent a therapeutic strategy to restore mucosal immunity. 
Taken together, these data reveal a central role for the micro- 
biota in the maintenance of the immune defects induced by 
acute infection. 
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Figure 6. Microbiota Sustains MAT Inflammation and Immune Dysfunction Post-infection 

(A and B) Specific pathogen-free (SPF) or germ-free (GF) C57BL/6 mice were infected with Y. pseudotuberculosis. (A) Sections of MLNs isolated 4 weeks post- 
infection were stained with picrosirius red. (B) Frequency and number of DC subsets in the MLN. Shown are representative flow cytometric contour plots of DC 
subsets (gated according to the Figure S2C). Numbers in plots represent the mean frequency (±SEM) of cells within the adjacent gate. Bar graph shows the 
numbers of CD1 03^CD1 1 b^ DCs in the MLN. 

(C and D) Four weeks post-infection, SPF mice were treated with broad-spectrum antibiotics (Abx) for 3 weeks. (C) Bar graphs show the number of 
CD1 03'^CD1 1 b"^ DCs (gated as indicated in Figure S4E), neutrophils, and eosinophils isolated from the MAT of naive or infected animals with or without antibiotic 
treatment. (D) Shown are representative plots of the MLN DC subsets from naive and infected mice, with and without antibiotic treatment. Numbers represent the 
mean frequency (±SEM) of cells within the adjacent gate of all samples within this experiment. Bar graph shows the number of CD1 03'^CD1 1 b"^ DCs in the MLNs. 
(E) Top: scheme for vaccination of antibiotic-treated mice. Abx treatment was continued during the vaccination period. 7 days post-immunization, lymphocytes 
were isolated from the siLP and stimulated in vitro with dmLT or OVA pulsed DCs. Bottom left: contour plots showing IFN-y and IL-17A producing CD4'^ T cells 
after vaccination in naive (Ctrl Vax) or infected (YP Vax) mice treated with Abx. Numbers represent the percent of 004"^ T cells that express each cytokine in the 
adjacent gate. The bar graph (bottom right) shows the mean percent of IL-17A producing CD4^ T cells from Abx treated naive or infected mice. All data are 
representative of two to three experiments with three to five mice in each control group and five to seven mice in each infected group. All bar graphs show the 
mean (±SEM). *p < 0.05, **p < 0.005, ***p < 0.0005 (Student’s t test). CL^, infected mice with lymphadenopathy; CL“, infected mice without lymphadenopathy; ns, 
not significant. 

See also Figure S5. 
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DISCUSSION 

Here, we report that a single acute infection can lead to long- 
lived “immunological scarring” with profound effects on tissue 
immunity that persist after the pathogenic organism has been 
cleared. Infections have long been considered as possible trig- 
gering elements in the development of chronic inflammatory dis- 
eases and immune dysfunction, but definitive and mechanistic 
associations between defined pathogens and the breakdown 
of tissue homeostasis have remained elusive. Our work provides 
a mechanistic framework via which infection may represent 
a point of no return for tissue immunity, leading to chronic 
inflammation and immune dysfunction long after the pathogen 
has been cleared. 

In this study, we found that oral infection with 
V. pseudotuberculosis profoundly and persistently remodels 
both the MAT and MLN. The pathological effect of infection in 
this mouse model reproduces the chronic lymphadenopathy 
that is a defining characteristic of Y. enterocolitica or 
Y. pseudotuberculosis infection in humans (Asano, 2012). 
Chronic mesenteric lymphadenopathy is not limited to Yersinia 
infection, as a variety of other diseases can also cause these 
symptoms, such as HIV/AIDS, Crohn’s disease, and celiac dis- 
ease (Asano, 201 2; Estes et al., 2008; Huppert et al., 2004; Lucey 
et al., 2005). The mechanism by which Y. pseudotuberculosis 
triggers damage to the lymphatic system is not understood, 
but is possibly related to the fact that Yersinia and other patho- 
gens drive phagocytic cells to travel through these vessels 
(Bou Ghanem et al., 2012; Diehl et al., 2013; St John et al., 
2014). We also show that increased permeability of lymphatics 
can be observed in other models of Gl infection or non-infectious 
colitis. As such, the point raised by our present work is not that 
Y. pseudotuberculosis per se is responsible for the breakdown 
of tissue immunity, but that in defined settings or in genetically 
predisposed individuals, Gl infections and/or inflammation, 
may play a determinant role in the disruption of immune homeo- 
stasis by remodeling the immune and lymphatic system. 

Two cytokines, IL-1 p and TNF-a, have been shown to increase 
lymphatic permeability and both are significantly increased in the 
MAT post-infection (Aldrich and Sevick-Muraca, 2013; Cromer 
et al., 2014). Another potential contributing factor in chronic 
inflammation of the MAT is the switch from type 2 to type 1 im- 
munity. Because type 2 immunity has recently been described 
as critical to tissue healing (Cause et al., 2013), persistent 
lymphatic damage post-infection may also be the consequence 
of a loss of these responses. There are a number of interesting 
parallels between the chronic remodeling of the MAT seen after 
Y. pseudotuberculosis infection and inflammatory bowel dis- 
ease. Notably, inflammation of the MAT, altered lymph drainage 
and presence of translocating commensals in this compartment 
has been described in Crohn’s disease patients (Behr, 2010; 
Peyrin-Biroulet et al., 2012; Zulian et al., 2013). In accordance 
with these studies, we measured increased lymphatic leakage 
in the T cell transfer model of colitis. Previous studies have 
also shown that the larger, contractile lymphatics are permeable 
to soluble antigens and that adipose tissue dendritic cells can 
sample molecules traveling in the lymph (Kuan et al., 2015). 
Furthermore, when permeability of lymphatic collecting vessels 



is increased, the absorptive function of upstream lymphatic cap- 
illaries, such as those in intestinal villi, is decreased (Scallan 
et al., 2013) a phenomenon that could be relevant to our 
present observation. Therefore, exploration of the involvement 
of lymphatic damage and inflammation in the MAT could be 
important to our understanding of the pathophysiology of 
Crohn’s disease and gut inflammatory disorders in general. 

Infection abrogated the accumulation of the migratory 
CD103'^CD11b^ DCs in the MLNs, a cardinal subset of gut 
DCs involved in the promotion of both T and B cell responses, 
(Bogunovic et al., 2009; Farache et al., 2013; Satpathy et al., 
201 3; Schlitzer et al., 201 3). We also observed a partial reduction 
in the frequency of CD103'^CD1 1 b“ DCs in the MLN, a pop- 
ulation associated with Treg induction (Coombes et al., 2007; 
Sun et al., 2007). Because inflammation can complicate the 
identification of these cells (Merad et al., 2013), migratory 
CD103'^CD11b“ DCs may be more profoundly affected by the 
infection than reported here, a point supported by the substantial 
impact of the infection on Treg induction. Of interest, infection- 
induced disruption of the positioning of DCs and macrophages 
in the LN has been observed in multiple settings of infection 
and vaccination and similar defects due to lymphadenopathy 
may further affect DC function in our model (Gaya et al., 2015; 
Katakai et al., 2004; Mueller et al., 2007). 

Immunological damage from acute infection had profound 
consequences for the maintenance of tissue-specific immunity, 
including defects in the development of Thi 7 cells, Tr©g, and IgA"^ 
B cell responses in the Gl tract. Collectively, these responses 
represent the canonical mucosal immune response required 
for oral tolerance, adaptive immunity, and compartmentalization 
of the microbiota (Belkaid and Hand, 2014). Our findings indicate 
that under conditions in which the environment is in flux, some 
acute infections may predispose to food allergy and aberrant im- 
mune responses to commensals via the abrogation of Treg induc- 
tion. In low to middle income countries, oral vaccines for Rota- 
virus, Poliomyelitis, Vibrio cholerae, and Shigella often fail to 
protect with the same degree of efficacy as in high-income coun- 
tries (Levine, 2010). Several interrelated conditions, including 
persistent infections, gut inflammation, malnutrition, and envi- 
ronmental enteropathy, have been proposed to contribute to 
this important public health issue (Korpe and Petri, 2012; Levine, 
2010). Our observed defects in DC trafficking and the develop- 
ment of Thi 7 and IgA responses to model oral vaccinations 
may provide an explanation for the immune dysfunction in these 
populations. Interestingly, when individuals suffering from envi- 
ronmental enteropathy are relocated to Western countries, 
where diet and sanitation are no longer issues, it can take up 
to 2 years for intestinal absorption to be restored, implying that 
the physiological defects outlive exposure to acute infections 
(Lindenbaum et al., 1972). Based on our present work, we pro- 
pose that the integrity of the MAT and lymphatics could be 
also severely affected under these settings and that this phe- 
nomenon may contribute to immune dysfunction. 

The microbiota plays a central role in the maintenance of the 
immunological damage caused by Y. pseudotuberculosis infec- 
tion. This observation reveals yet another way by which interrup- 
tion of the host’s relationship with its microbiota can compro- 
mise tissue homeostasis. Our current model proposes that 
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such effects may not be associated with a defined microbiota. 
Notably, mice developing lymphadenopathy do not have a 
unique microbiota compared to those mice that did not develop 
lymphadenopathy and oral gavage of defined microbial products 
can phenocopy the effect of the microbiota. Thus, increased 
exposure to microbes or microbe-derived ligands provided by 
the increased leakage of the lymphatics may be sufficient to 
induce a positive feedback loop of inflammation in the MAT, an 
effect that could predispose to the development of inflammatory 
disorders. On the other hand, we cannot exclude the possibility 
that benign microbes such as lactobacilli that can be found in the 
CL^ MLN could be contextually pathogenic post-infection. 

The “hygiene hypothesis” proposes that increasing rates 
of allergy and asthma in western countries could be the conse- 
quence of reduced infectious disease, in particular helminthic 
worm infection during early childhood (Johnston et al., 2014; 
Schaub et al., 2006; Weinstock et al., 2004). More recently, 
this model has been modified to include the modern shift in the 
human microbiota (Blaser and Falkow, 2009). Together, these 
profound changes in microbial encounters and, as a direct result, 
the state of the immune system, are believed to contribute to 
the dramatic increase in chronic inflammatory and autoimmune 
disorders seen in high-income countries. Our present work 
proposes that this model also needs to integrate the change in 
the frequency and type of acute infections associated with 
high-density urban living. Modern societies are now burdened 
with endemic respiratory and Gl infections, whose long-term 
sequelae are not fully understood. These repeated and unregu- 
lated inflammatory challenges may profoundly remodel the im- 
mune system and thereby contribute to the increased burden 
of autoimmune and inflammatory disorders. 

Together, this study provides a framework to understand how 
previously encountered infections can induce a breakdown of 
tissue immune homeostasis, thereby contributing to disease 
later in life. Thus, in order to fully comprehend the etiology of 
complex diseases, it may be necessary to look beyond a pa- 
tient’s genetic susceptibilities and concurrent environmental 
stressors and examine whether immunological scarring asso- 
ciated with previous infections may have “set the stage” for 
chronic inflammation. 

EXPERIMENTAL PROCEDURES 
Mice 

C57BL/6NTac SPF mice were purchased from laconic Farms. Germ-free 
C57BL/6 mice were bred at laconic Farms and maintained in the Nationai 
Institute of Allergy and Infectious Diseases (NIAID) gnotobiotic facility. 
C57BL/6NTac-Lysozyme {Lyz2) (LysM-eGFP reporter), B6.SJL-Ptprc®/ 
BoyAiTac (CD45.1), C57BL/6-|Tg]CD11c(/fgax):EYFP (CD11c-eYFP reporter), 
B6.129P(Cg)-Ptprc^ CxscrT^^ ^''VUttJ (CX3CRI GFP reporter), and C57BL/ 
6-CD45a{Ly5a)-Rag1~^~ TCP OT-II {Rag1~^~ OT-II TCP transgenic) mice 
were obtained through the NIAID-Taconic exchange program. All mice were 
bred and maintained under pathogen-free conditions at an American Associ- 
ation for the Accreditation of Laboratory Animal Care accredited animal facility 
at the NIAID and housed in accordance with the procedures outlined in the 
Guide for the Care and Use of Laboratory Animals. Gender- and age-matched 
mice between 6-12 weeks old were used. When possible, preliminary exper- 
iments were performed to determine requirements for sample size, taking into 
account resources available and ethical, reductionist animal use. In general, 
each mouse of the different experimental groups is reported. Exclusion criteria 



such as inadequate staining or low cell yield due to technical problems were 
pre-determined. 

Oral Infection, Vaccination, and Treatments 

For infection, Y. pseudotuberculosis (strain IP32777) (Simonet and Falkow, 
1992) was grown into 2XYT media (Quality Biological) overnight at 25°C with 
vigorous shaking. Mice were fasted for 12 hr prior to infection with 1x10^ 
CFU via oral gavage. Oral vaccination with double mutant E. coll heat labile 
toxin (R192G/L21 1 A) (dmLT) (provided by J. Clements) was carried out as pre- 
viously described (Hall et al., 2008). Mice received two immunizations (7 days 
apart and response was assessed 7 days after). For broad-spectrum antibiotic 
treatment, mice were treated as previously described (Hall et al., 2008). 
Commensal-derived DNA/LPS gavage of antibiotic-treated mice was per- 
formed as previously described (Hall et al., 2008), with the modification that 
250 i^g of commensal DNA and 250 |ig of LPS (from E. coll] Sigma) were gav- 
aged every other day for a week. 

Analysis of Leaking in the MAT 

Bodipy FL Cl 6 (40 lag/mouse) (Life Technologies) mixed with milk cream (1 0%) 
(100 III) and olive oil (100 |il) was gavaged into mice and MAT tissue was 
isolated for microscopy 3 hr later and analyzed by scanning fluorescent 
microscopy (Leica M205 FA, motorized stereo microscope) and confocal 
microscopy (Leica SP8). 

Statistics 

Data are presented as mean ± SEM. Group sizes were determined based on 
the results of preliminary experiments. Mice were assigned at random to 
groups. Mouse studies were not performed in a blinded fashion. Statistical 
significance was determined with the two-tailed unpaired Student’s t test, 
under the untested assumption of normality. Within each group there was an 
estimate of variation, and the variance between groups was similar. All statis- 
tical analysis was calculated using Prism software (GraphPad). Differences 
were considered to be statistically significant when p < 0.05. 

For more detailed information, please consult the Supplemental Experi- 
mental Procedures. 
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SUMMARY 

Intestinal Th17 cells are induced and accumulate in 
response to colonization with a subgroup of intesti- 
nal microbes such as segmented filamentous bac- 
teria (SFB) and certain extracellular pathogens. 
Here, we show that adhesion of microbes to intesti- 
nal epithelial cells (ECs) is a critical cue for Th17 in- 
duction. Upon monocolonization of germ-free mice 
or rats with SFB indigenous to mice (M-SFB) or rats 
(R-SFB), M-SFB and R-SFB showed host-specific 
adhesion to small intestinal ECs, accompanied by 
host-specific induction of Th17 cells. Citrobacter ro- 
dentium and Escherichia coli 0157 triggered similar 
Th17 responses, whereas adhesion-defective mu- 
tants of these microbes failed to do so. Moreover, a 
mixture of 20 bacterial strains, which were selected 
and isolated from fecal samples of a patient with 
ulcerative colitis on the basis of their ability to cause 
a robust induction of Th17 cells in the mouse colon, 
also exhibited EC-adhesive characteristics. 

INTRODUCTION 

The gut microbiota contributes to the constitutive development 
of Th17 cells in the intestinal lamina propria (LP) (Atarashi et al., 
2008; Ivanov et al., 2008). Among commensals, segmented 



filamentous bacteria (SFB) are one of the most potent inducers 
of Th17 cells, and monocolonization of mice with SFB causes 
abundant accumulation of Th17 cells in the small intestinal (SI) 
LP (Gaboriau-Routhiau et al., 2009; Ivanov et al., 2009). 
Recent reports have shown that most of the intestinal Th17 
cells induced by SFB have T cell receptors (TCRs) that specif- 
ically recognize SFB antigens (Goto et al., 2014; Yang et al., 
2014). However, since the SFB antigens themselves do not 
dictate Th17 differentiation (Yang et al., 2014), and micro- 
biota-mediated Th17 cell development occurs independently 
of major innate immune receptors (Atarashi et al., 2008; 
Ivanov et al., 2009), SFB colonization must elicit unique 
signaling pathways in the intestine to generate a Th17-condu- 
cive environment. 

SFB are spore-forming gram-positive bacteria with a 
segmented and filamentous morphology, and tight adhesion to 
SI epithelial cells (ECs) is a remarkable characteristic feature of 
these bacteria (Davis and Savage, 1974). SFB are widely distrib- 
uted in vertebrates (Klaasen et al., 1993). In spite of the morpho- 
logical similarities of SFB isolated from various hosts, their 16S 
rRNA gene sequences differ, and several reports suggest that 
SFB have undergone host species-specific selection and adap- 
tation (Chung et al., 2012). The complete genomic sequences 
of SFB colonizing the mouse and rat intestines, referred to as 
M-SFB and R-SFB, respectively, were determined. Although 
the overall genomic organization of M-SFB and R-SFB are 
similar, 5%-10% of the genes are specific to each strain, 
and the amino acid sequence identity between orthologous 
gene pairs is on average 80% (Prakash et al., 2011). Analysis 
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of differences between M-SFB and R-SFB may be useful 
to improve understanding of the effects of SFB on the immune 
system. 

In addition to SFB colonization, infections with several extra- 
cellular pathogens such as Candida albicans and Citrobacter 
rodentium are known to induce Th17 cells (Conti and Gaffen, 
2010; Mangan et al., 2006). Th17 cells induce the recruitment 
of neutrophils and activation of ECs, leading to enhanced clear- 
ance of extracellular pathogens in concert with other immune 
cells such as IgA-secreting plasma cells and group 3 innate 
lymphoid cells (ILC3s). The induction of Th17 cells by those 
pathogens has been postulated to be mediated by the local cyto- 
kine milieu produced by intestinal ECs and specific subsets of 
myeloid cells (Weaver et al., 2013). However, it remains unclear 
which features of these particular microbes specifically elicit 
Th17 versus other types of immune cell responses at intestinal 
mucosal sites. 

Because SFB and C. rodentium commonly adhere to ECs, 
we hypothesized that adhesion-mediated activation of ECs 
plays a pivotal role in the induction of Th17 cells. Accordingly, 
we examined the ability of M-SFB, R-SFB, wild-type, and mutant 
strains of C. rodentium and enterohemorrhagic Escherichia 
coli (EHEC) 0157:H7 to adhere to ECs and induce Th17 cells. 
In addition, by combining gnotobiotic technique and anaerobic 
culturing of members of the intestinal microbiota from a patient 
with ulcerative colitis (UC), we isolated 20 strains based on their 
ability to induce Th17 cells in mice and examined EC-adhesive 
characteristics of these 20 Th17-inducing human strains. Our 
findings indicate that adhesion to ECs is a common mechanism 
used by intestinal microbes to activate host Th17 responses. 

RESULTS 

Host-Specific Adhesion to SI ECs and Th17 Induction 
by SFB 

C57BL/6 (B6) or IQI germ-free (GF) mice were orally inoculated 
with R-SFB or M-SFB, and their intestinal colonization was 
monitored by qPCR analysis. The concentration of fecal and SI 
luminal R-SFB DMA quickly increased and reached a plateau 
within 1 week; the kinetics and levels were comparable to those 
of M-SFB (Figures 1A and S1A). Consistent with the qPCR re- 
sults, Gram-stained smears of cecal luminal contents contained 
equivalent numbers of R-SFB and M-SFB with indistinguishable 
morphology (Figure S1 B), indicating that R-SFB and M-SFB both 
colonize and grow robustly within the mouse intestinal lumen. In 
contrast, when SI mucosa-associated SFB DMA amounts were 
examined, we detected much lower levels of R-SFB than of 
M-SFB (Figure 1 A). We also performed scanning electron micro- 
scopy (SEM) of washed SI mucosa to visualize EC-adhering 
SFB. Numerous M-SFB were observed adhering tightly to the 



mouse SI epithelium. In contrast, we did not detect any R-SFB 
adhering to SI ECs (Figures IB and S1C). R-SFB also failed to 
adhere to SI ECs in monocolonized Rag1~'~ mice or IL- 
2R^~'~Rag2~'~ mice (Figures S1D and S1E), making it unlikely 
that the impaired adhesion of R-SFB to mouse SI ECs was due 
to a R-SFB-specific reaction by the mouse immune system. 
Although we detected some EC-adhering M-SFB and R-SFB 
in the mouse colon, these were less frequent than in the SI 
(Figure 1 B). 

We then investigated the effects of monocolonization of mice 
with M-SFB or R-SFB on the induction of Th17 cells. In agree- 
ment with previous reports (Gaboriau-Routhiau et al., 2009; 
Ivanov et al., 2009), a robust accumulation of Th17 cells was 
observed in the SI LP of B6 or IQI mice monocolonized with 
M-SFB. In contrast, R-SFB colonization had no significant effect 
on the number of Th1 7 cells (Figures 1 C and 1 D). Notably, both 
M-SFB and R-SFB induced the accumulation of Th17 cells in 
the colonic LP, although less efficiently than that in the SI LP 
following M-SFB colonization (Figure 1D), correlating with the 
EC adhesion efficacy of the SFB. 

We next performed the reciprocal experiment by monocolo- 
nizing GF F344 rats with R-SFB or M-SFB. After oral inoculation 
of GF rats with R-SFB, high levels of R-SFB DNA were detected 
in feces and SI mucosal tissues by qPCR (Figure 1E). Examina- 
tion of the rat SI mucosal surface by SEM revealed the frequent 
presence of R-SFB adhering tightly to the follicular-associated 
epithelium (FAE) and villous ECs (Figure 1 F). In contrast, far fewer 
M-SFB associated with the rat SI ECs, although the amount of 
M-SFB and R-SFB DNA in the feces was comparable (Figures 
IE and 1F). We then investigated the effects of R-SFB and 
M-SFB monocolonization on Th17 cells in rats. The frequency 
of Th17 cells in the SI LP was markedly increased after monoco- 
lonization of GF rats with R-SFB, whereas M-SFB-monocoloni- 
zation induced only a slight increase in the frequency of these 
cells (Figures 1G and 1H). Taken together, adhesion of SFB to 
intestinal ECs is a host-specific event and strongly correlates 
with Th17 cell induction. 

Induction of Antigen-Specific Th17 Cells by 
EC-Adhesive SFB 

To examine whether EC adhesion is required for priming SFB 
antigen-specific Th17 cell differentiation, we generated mice 
co-colonized with M-SFB and R-SFB and assessed the antigen 
specificity of the SI LP Th17 cells. In the feces of co-colonized 
mice, similar amounts of M-SFB and R-SFB DNA were detected 
(Figure S2A). SI LP cells, which include T cells and antigen-pre- 
senting cells (APCs), were isolated from the co-colonized mice, 
and stimulated ex vivo either with PMA and ionomycin (P/I) or 
with autoclaved cecal content from GF, M-SFB-, or R-SFB- 
monocolonized mice. Cytokine expression was analyzed as a 



Figure 1. EC Adhesion and Th17 Induction by SFB in Mice and Rats 

B6 or IQI mice (A-D) and F344 rats (E-H) were monocolonized with M-SFB or R-SFB for 3 weeks. 

(A and E) qPCR analysis for SFB DNA in feces, SI luminal contents, and mucosal tissue specimens. 

(B and F) SEM images of epithelial surfaces in the SI and colon. 

(C, D, G, and H) Th17 cell frequencies in SI and colon LP. Representative dot plots gated on total lymphocytes (C and G) and summarized data gated on CD4'^ 
T cells (D and H) are shown. 

Error bars represent SD. See also Figure SI . 
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Figure 2. Effects of SFB Adhesion on Anti- 
gen-Specific Th17 Cells, IgA^ Cells, and 
ILC3S 

(A and B) Cytokine responses of SI LP CD4'^ T cells 
from M-SFB and R-SFB co-colonized B6 mice to 
the indicated ex vivo stimulation. A/C, autoclaved 
cecal contents; P/I, PMA/ionomycin. 

(C) Fecal IgA levels of indicated monocolonized B6 
mice and F344 rats (n = 5). 

(D) The frequencies of ILC3s among CDGO"^ cells 
(left dot plots) and IL22^ cells among ILC3s (right 
histograms and bar graphs) in SI LP cells of the 
indicated mice. 

Error bars represent SD. See also Figure S2. 
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readout for TCR activation. Although less efficient than P/I, the 
M-SFB antigens evoked IL-1 7 expression in a significant number 
of cells (Figures 2A and 2B). In contrast, the R-SFB antigens eli- 
cited no response. Therefore, a substantial fraction of Th17 cells 
that accumulated in the SI LP of co-colonized mice had TCRs 
specific to the EC-adhering M-SFB, but not to the non-adhering 
R-SFB. These results suggest that antigens from EC-adhering 
SFB are taken up and presented by APCs much more efficiently 
than those of non-adhering SFB. 

Recently, two major protein antigens responsible for M-SFB- 
mediated SI LP Th17 cell induction were identified (SFBNYU_ 
003340 and SFBNYU_004990) (Yang et al., 2014). We analyzed 
the R-SFB genome and identified the gene encoding the orthol- 
ogous protein corresponding to SFBNYU_003340, but could 
not identify an SFBNYU_004990 ortholog. The minimal epitope 
within M-SFB SFBNYU_003340 was mapped as QFSGAVPNK 
(Yang et al., 2014), and the corresponding epitope region in 
R-SFB was found to be QFNGQVPNN. The M-SFB epitope 
peptide efficiently induced a Th17 response when added to the 
ex vivo cultures of SI LP cells isolated from specific pathogen- 
free (SPF) mice (Figure S2B). In contrast, the same SI LP cells 
had no reactivity to the R-SFB peptide QFNGQVPNN. Therefore, 
peptides serving as dominant antigens for SI LP Th1 7 cell induc- 
tion in mice are specifically expressed by M-SFB. 



Effects of EC-Adhering SFB on IgA*^ 
and ILC3 Cells 

SFB colonization has been shown to 
enhance IgA production (Talham et al., 
1999; Umesaki et al., 1999). Consistent 
with these reports, monocolonization of 
B6 mice with M-SFB induced a signifi- 
cant increase in the levels of fecal IgA 
and frequencies of lgA'"CD138^ plasma 
cells in the SI LP (Figures 2C and S2C). 
In contrast, R-SFB monocolonization in 
mice resulted in a weak IgA response 
(Figures 2C and S2C). We also examined 
IgA production in SFB monocolonized 
rats. Fecal IgA levels were substantially 
increased after monocolonization of GF 
rats with R-SFB, whereas M-SFB-mono- 
colonization induced only a modest in- 
crease (Figure 2C). These findings sug- 
gest that luminal colonization is insufficient for the full induction 
of IgA"^ cells, but instead an interaction between SFB and ECs 
is required, which is similar to Th17 cells. 

Next, we examined the effects of SFB colonization on ILC3s, 
which share many features with Th17 cells (Spits et al., 2013). 
Thy1^CD3'RORyt^ cells corresponding to ILC3s were much 
more abundant in the SI LP of SPF B6 mice than in the spleen 
(Figure S2D), and ~70% of these cells became positive for 
IL-22 after ex vivo stimulation with IL-23 (Figure 2D). Although 
the abundance of SI LP ILC3s in GF versus SPF mice did not 
differ significantly, the frequency of IL-22-proficient (IL-22'^) 
cells among ILC3s was dramatically decreased in GF mice 
(Figure 2D). Monocolonization of mice with M-SFB resulted in 
a significant increase in the frequency of IL-22'^ SI ILC3s, 
although the level did not reach that observed in SPF mice (Fig- 
ure 2D). The increase in IL-22'^ cells upon monocolonization 
with M-SFB was observed for both NKp46^ and NKp46“ 
ILC3 populations (Figure S2E). Interestingly, R-SFB induced a 
significant increase in the frequency of IL-22'^ cells with a 
magnitude similar to that of M-SFB (Figure 2D). The induction 
of IL-22'^ ILC3s was also observed in mice inoculated with 
feces from Jackson Laboratory (JAX) SPF mice, which are 
known to be devoid of SFB (Ivanov et al., 2009) (Figure 2D), 
indicating that IL-22^ ILC3s can be induced by non-adhering 
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SFB and even by non-SFB commensals, which is in contrast to 
Th17 and IgA-" cells. 

SI EC Activation by Adhering SFB 

We next examined the influence of SFB adhesion on SI EC gene 
expression profiles by RNA sequencing (RNA-seq). The expres- 
sion of several genes was highly upregulated in SI ECs of 
M-SFB-colonized mice compared to GF and R-SFB-colonized 
mice (Figures 3A and S3A). The upregulated genes included 
three isoforms of serum amyloid A (SAA), regenerating islet- 
derived protein 3 p (Reg3P) and Reg3y and nitric oxide synthase 
2 (Nos2). The expression differences of selected genes were 
confirmed by qPCR analysis (Figures 3B and S3B). Elevated 
levels of SAA1 protein in SI ECs of M-SFB-monocolonized 
mice were clearly visualized by immunofluorescence micro- 
scopy (Figure 3C). Recombinant SAA1 markedly enhanced 
the in vitro differentiation of naive CD4‘^ T cells into Th17 cells 
mediated by CD11c^ cells, IL-6, and transforming growth 
factor p (TGF-p) (Figure 3D). This SAA1 -mediated enhancement 
was severely attenuated in the absence of CD1 1 c"^ cells or of IL-6 
and TGF-p, or in the presence of an anti-IL-1 receptor 1 (IL-1 R1) 
blocking antibody (Figures 3D and 3E). In fact, SAA1 elevated the 
expression level of IL-1p mRNA in splenic CDUc"^ cells (Fig- 
ure 3F). Therefore, the current data extend previous studies re- 
porting that SAAs are induced in SI ECs by colonization with 
SFB (Ivanov et al., 2009) and suggest that SAA induction requires 
SFB adhesion to ECs. Furthermore, the SAAs produced by this 
mechanism likely condition neighboring CDUc"^ myeloid cells 
to produce IL-1 p and probably other factors that act with IL-6 
and TGF-p to enhance Th17 cell differentiation. 

In addition to the increase in SAAs, mRNA levels of the reactive 
oxygen species (ROS)-generating enzyme dual oxidase 2 
(Duox2) and of its maturation factor Duoxa2 were also highly up- 
regulated in SI ECs in M-SFB-monocolonized mice (Figures 3A 
and 3B). Treatment of M-SFB-monocolonized mice with a ROS 
scavenger, N-acetyl-L-cysteine (NAC), in the drinking water 
significantly limited Th17 cell induction (Figure 3G) without 
affecting adhesion of M-SFB to SI ECs (Figure S3C). Therefore, 
EC-adhering SFB affect Th17 cell accumulation via integration 
of a number of different molecular mechanisms including pro- 
duction of SAAs and ROS by ECs. 

Toward an understanding of the molecular basis of adherent 
SFB-mediated SAA induction, we explored the transcriptional 
regulatory elements of the Saa1 gene locus. We identified two 
DNase I hypersensitive sites downstream of the Saa1 gene, 
which were uniquely active in the SI and distinguished by acety- 
lated histone 3 lysine 27 (H3K27ac) active enhancer marks 
(Shen et al., 2012) (Figure 3FI). Investigation of publicly available 
datasets of chromatin immunoprecipitation sequencing (ChlP- 
seq) for transcription factor (TF) binding (Camp et al., 201 4) sug- 
gested interactions of CCAAT-enhancer-binding protein (C/EBP) 
with both regulatory regions of Saa 7 (Figure 3H). Our SI EC RNA- 
seq data and qPCR analysis showed that C/EBP6 was specif- 
ically upregulated in SI ECs of M-SFB-monocolonized mice 
(Figures 3I and S3B), suggesting that it may be a key TF regu- 
lating SAA1 expression. In line with this finding, we observed 
co-induction of SAA1 and C/EBP5 expression in a mouse SI 
EC line (aMos7) co-cultured with M-SFB in vitro (Figure 3J). 



Induction of SAA1 and C/EBP6 expression was observed only 
when ECs were in direct contact with M-SFB (Figure 3J); under 
these conditions M-SFB adhesion to the aMoS7 cells was 
confirmed by fluorescence in situ hybridization (FISH) and SEM 
(Figures S3D and S3E). The induction of SAA1 and C/EBP6 
expression by M-SFB was further enhanced by the addition 
of F-actin inhibitory drugs, latrunculin A, or swinholide A (Fig- 
ure S3F). Therefore, actin reorganization in ECs induced by 
SFB adhesion may lead to the elevation of C/EBP6 expression, 
which then contributes to SAA1 expression. 

Influence of Mouse Genetic Background on 
SFB-Mediated Th17 Cell Induction 

All of the above experiments were performed using B6 or IQI 
mice. We next examined the influence of mouse genetic back- 
ground on intestinal Th17 cell induction. The frequency of Th17 
cells in SPF C57BL/10 and B10.D2 mice was similar to that 
in B6 and IQI (ICR) mice (Figure 4A). The only exception was 
BALB/c mice, which had a much lower frequency of Th17 
cells even under SPF conditions or after monocolonization with 
M-SFB (Figures 4A, 4B, and S4A). It is noteworthy that BALB/c 
mice had substantial numbers of RORyt^ single-positive cells 
but had severe reduction of RORyf^lL-17^ double-positive cells 
(Figure S4A). M-SFB, but not R-SFB, were found to adhere to SI 
EC surfaces in BALB/c mice, similar to the results in B6 mice 
(Figures 4C and S4B) and splenic naive CD4'^ T cells from 
BALB/c mice gave rise to a normal frequency of RORyf^lL-17'^ 
cells when differentiated under Th17 conditions (Figure S4C), 
suggesting that SFB adhesion to SI ECs occurs normally and 
that CD4'^ T cells do not have an intrinsic differentiation defect, 
but instead that EC activation and/or subsequent LP cell 
signaling required for RORyf^lL-17'^ Th17 induction may be 
defective in BALB/c mice. 

Among Th1 7-conducting cytokines, microbiota-induced 
constitutive and high-level expression of IL-1p was observed in 
SI LP CD1 Ic"^ cells of B6 mice (Figure 4D), which is in line with 
previous reports (Mortha et al., 2014; Shaw et al., 2012). IL-1p 
expression was significantly reduced in M-SFB-monocolonized 
and SPF BALB/c mice, compared with that of B6 mice (Fig- 
ure 4E). Besides the reduced expression of IL-1p in CDUc"^ 
cells, expression of SAA1/2 was significantly decreased in SI 
ECs of BALB/c mice (Figures 4F and S4D). Injection of exoge- 
nous IL-1 p into SPF BALB/c mice elevated expression of SAAs 
in SI ECs (Figure 4G) and this was accompanied by an increase 
in SI LP RORyf^lL-1 7^ Th1 7 cells (Figures 4J and S4E), suggest- 
ing that insufficient IL-1p production by LP CD11c^ cells is 
responsible, at least in part, for the reduction in SAA expression 
and Th1 7 cell accumulation in BALB/c mice. Consistent with this 
interpretation, B6 Il1r1~'~ mice phenocopied BALB/c mice, dis- 
playing decreased expression of SAAs in SI ECs (Figure 4H) and 
a significant reduction in the number of SI LP RORyf^lL-1 Th1 7 
cells (Figures 4K and S4F). It is noteworthy that IL-1 p administra- 
tion induced Th17 cell accumulation in M-SFB-colonized mice, 
but not in R-SFB-colonized or GF BALB/c mice, revealing a 
requirement for the presence of EC-adhering bacteria for the 
IL-1P effect (Figures 4J and S4E). 

BALB/c mice had reduced frequencies of IL-22'^ cells 
among ILC3s compared with B6 mice under both SPF and 
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Figure 3. Adhesion-Mediated EC Activation 

(A) Heatmap showing the relative abundance for gene transcripts upregulated in SI ECs of M-SFB-mono versus GF and R-SFB-mono IQI mice. Each column 
represents a single mouse. 

(B) qPCR for the selected genes relative to Gapdh in SI ECs from the indicated mice. 

(legend continued on next page) 
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Figure 4. Influences of Mouse Genetic 
Background 

(A and B) The percentages of Th17, IL-22^ ILC3, 
and CDISS^IgA"^ cells in SI LP of the indicated 
mice. 

(C) SEM images of SI epithelial surface of the 
indicated BALB/c mice. 

(D and E) IL-1(3 protein production (D) and mRNA 
expression (E) from SI LP CDUc"^ and CD11c“ 
cells from the indicated mice. 

(F-l) Saa1l2 mRNA expression in SI ECs from the 
indicated mice. 

(J and K) The percentage of RORyt^lL-17^ cells 
among SI LP CD4^T cells in IL-1 |3-injected BALB/c 
mice (J) or Il1r1~'~ B6 mice (K). 

Error bars represent SD. See also Figure S4. 



Sion of SAAs by SI ECs (Figure 4I), sug- 
gesting that IL-22 from ILCSs, together 
with IL-1p from CD11c^ cells, act on SI 
ECs to potentiate their expression of 
SAA. Vil-Cre crossed with Il22ra1^'^ mice 
showed a tendency toward reduction of 
RORyf^lL-1 7-^ Th1 7 cells in the SI LP (Fig- 
ure S4H). Therefore, adhesive SFB may 
create a complex circuitry of interactions 
between ECs, DCs and ILCSs mediated 
by SAA, IL-1 p, IL-22, ROS, and probably 
other factors, to amplify and maintain 
signaling for constitutive accumulation of 
Th17cells (Figure S4I). 
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M-SFB-monocolonized conditions, whereas the frequency of 
IgA'^ cells was normal (Figures 4B and S4G). An EC-specific 
deficiency of the IL-22 receptor a1 gene (Vil-Cre crossed with 
Il22ra1^'^) on an SPF B6 background resulted in reduced expres- 



Extracellular Pathogens Induce 
Identical Th17 Responses through 
EC Adhesion 

To further test the link between bacterial 
EC adhesion and Th17 induction, we 
examined Citrobacter rodentium, an at- 
taching/effacing enteropathogen known 
to induce Th17 cells in mice (Mangan 
et al., 2006). B6 GF mice were mono- 
colonized with wild-type or a mutant 
C. rodentium strain lacking the eae gene, which encodes Intimin, 
a protein essential for EC adhesion (Kamada et al., 2012). The 
Aeae and wild-type C. rodentium expanded equivalently in 
the intestinal lumen, as demonstrated by recovery of similar 
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(C) Immunostaining of Sis from the corresponding mice for SAA1 (red) and DAPI (blue). Scale bar, 100 |am. 

(D and E) The percentage of RORyf^lL-l 7"^ cells after culture of splenic naive CD4 T cells in the presence of various combinations of splenic CD1 1 cells, IL-6, 
TGF-|3, SAA1, and/or anti-IL-1R1 antibody. 

(F) 111 b expression in splenic CD1 Ic"^ cells from B6 SPF mice stimulated with recombinant SAA1 . 

(G) SI LP Th17 cell frequencies of IQI M-SFB-mono mice either untreated (-) or treated with NAG. 

(H) RNA-seq and DNase-seq data from SI ECs of GF and conventional (CV) mice at the region surrounding the Saal locus. H3K27ac ChIP-seq data for SI and 
various other tissues obtained from (Shen et al., 201 2). ChIP-seq datasets for TFs assayed in various cell and tissue types. SI EC regulatory regions are highlighted 
in gray. 

(I) The relative abundance for TF genes that are >2-fold different between M-SFB-mono mice versus GF or R-SFB-mono mice. 

(J) SAA1 and C/EBP6 genes expression in aMoS7 cells cultured with cecal content from GF mice or a Percoll-enriched M-SFB or R-SFB either in contact or 
separated by a Transwell membrane (TW). 

Error bars represent SD. See also Figure S3. 
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Figure 5. EC Adhesion-Mediated Th17 Induction by C. rodentium 

IQI mice were monocolonized with wild-type (WT) or Aeae mutant of C. rodentium for 5 days. 

(A) DNA of C. rodentium in colonic luminal contents and mucosal tissues. 

(B) SEM images of colonic villi of the indicated mice. 

(C) C. rodentium localization visualized by O antigen antisera (green) and DAPI (blue) staining. Scale bar, 100 |im. 

(D) The percentage of RORyt^lL-17^ colonic CD4^ T cells. 

(E) Fecal IgA levels of the indicated mice (n = 4). 

(F) The relative abundance of gene transcripts that were commonly upregulated in M-SFB-mono mice versus R-SFB-mono mice and WT C. rodentium-mono 
mice versus Aeae C. rodentium-mono mice. 

(G) Duox2 and Duoxa2 mRNA expression in colonic ECs of the indicated mice. 

(H) Th17 cell frequencies of WT C. /'oc/enf/um -mono IQI mice either untreated (-) or treated with NAG. 

Error bars represent SD. See also Figure S5. 



amounts of bacterial DNA from the colon luminal contents of 
both groups of mice after 5 days of monocolonization (Figure 5A). 
SEM and immunofluorescence microscopy revealed that the 



wild-type strain had a strong tendency to adhere to colonic 
ECs, whereas the Aeae strain did not adhere to ECs and the 
inner mucus layer of the colon remained intact (Figures 5B and 
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5C). These results were further confirmed by qPCR quantifica- 
tion of mucosa-associated C. rodentium (Figure 5A). Coloniza- 
tion with wild-type C. rodentium resulted in potent induction of 
Th17 cells in the colon, whereas Aeae C. rodentium elicited a 
far weaker Th17 response (Figure 5D). Colonization with wild- 
type C. rodentium induced a greater increase in fecal IgA 
levels than the mutant strain (Figure 5E). Therefore, similarly to 
commensal SFB, pathogenic C. rodentium induces Th17 and 
IgA responses, most likely through adhesion to ECs. Of note, 
during the early phase (5 days after infection), when the epithelial 
layer was relatively intact, infection with wild-type C. rodentium 
caused a more conspicuous increase in Th17 cells than IFN- 
y-producing Th1 cells. However, in the late phase (14 days after 
infection), when the epithelial layer was severely disrupted and 
bacterial invasion had occurred, there was a massive increase 
in Th1 cells rather than Th17 cells (Figures S5A and S5B), sup- 
porting the requirement of intact ECs for Th17 induction in 
response to C. rodentium, without which the same microbe 
would elicit a different type of immune response. We also exam- 
ined IL-22'^ ILC3 induction by C. rodentium colonization. The 
Aeae strain induced a robust increase in the frequency of colonic 
IL-22'^ ILC3 cells comparable to the response observed in mice 
colonized with the wild-type strain (Figure S5C), indicating that 
immune stimulatory activity is not completely abrogated in the 
Aeae strain, and the induction of IL-22'^ ILC3 cells occurs inde- 
pendently of bacterial EC-adhesion. 

Next, we performed gene expression analysis by RNA-seq of 
colon ECs from mice monocolonized with the Aeae or wild-type 
C. rodentium strain. In comparison to the Aeae strain, the wild- 
type strain specifically induced the expression of genes, such 
as Duox2, Duoxa2, Reglllp, Regllly, and Nos2, which overlap 
with genes induced in SI ECs by colonization with EC-adherent 
SFB (Figures 5F, 5G, S5D, and S5E), although the expression 
of SAAs in the colon was not significantly enhanced by 
C. rodentium monocolonization (Figure S5D). Treatment of 
wild-type C. rodentium-monoco\on\ze6 mice with NAC in the 
drinking water significantly limited Th17 cell induction (Fig- 
ure 5H), without affecting adhesion of C. rodentium to ECs 
(Figure S5F). Therefore, EC adhesion and subsequent ROS 
production is a potential mechanism underlying C. rodentium- 
mediated Th17 cell induction. 

We next examined EHEC 0157:H7, another EC-adherent 
bacterium that can cause hemolytic uremic syndrome in humans 
and is a global public health concern (Croxen and Finlay, 2010). 
Because the wild-type 0157 strain was lethal in GF mice shortly 
after the monocolonization (data not shown), we used an 0157 
strain deficient in both Shiga toxin 1 and 2 genes (Asfx7 Asfx2) (Yo- 
koyama et al., 2001). This double mutant strain expanded in the 
intestine upon monocolonization without killing the mice and 
adhered to colonic EC surfaces (Figures 6A and 6B). The adhesion 
was accompanied by induction of colonic Th1 7 cells (Figure 6C). 
We subsequently constructed a Asfx7Asfx2Aeae triple mutant 
strain, which expanded comparably to the Asfx7Asfx2 double 
mutant strain in the intestinal lumen but did not adhere to colonic 
ECs (Figures 6A and 6B), and this was accompanied by loss of the 
ability to induce Th1 7 cells (Figure 6C). 

Candida aibicans is also an EC-adhesive microbe (Figure S6) 
and was found to increase the frequency of Th17 cells in the 



colon LP of monocolonized mice (Figure 6D). In contrast, 
monocolonization of mice with Listeria monocytogenes mainly 
induced Th1 cells, rather than Th17 cells (Figure 6E). The mice 
colonized with L. monocytogenes displayed disrupted structure 
of colon epithelial surface, whereas adhering bacteria were 
rarely observed (Figure 6F). 

Th17-lnducing Human Microbiota 

Finally, we attempted to identify members of the human gut mi- 
crobiota that could exert immunological effects equivalent to 
those of SFB. To this end, we collected human fecal samples 
from patients with DC (UC4-2 and UC5-1) and healthy donors 
(H1 1 , H1 7, H23) and inoculated them into GF IQI mice. All groups 
of mice exhibited a significant increase in Th17 cells in the 
colon (Figure 7A). The Th17 cell induction by H23 microbiota 
was suppressed by treatment with ampicillin (Amp) or other an- 
tibiotics (Figure 7B). In contrast, the UC5-1 microbiota-mediated 
Th17 cell induction was further enhanced when the mice were 
given Amp in the drinking water, although reduced by metronida- 
zole (MNZ) or vancomycin (VCM) treatment (Figure 7B). There- 
fore, different subsets of bacterial species were likely to be 
responsible for the induction of Th17 cells in mice inoculated 
with H23 versus UC5-1 samples. Cecal microbiota analysis by 
16S rRNA gene sequencing revealed several operational taxo- 
nomic units (OTUs) that were enriched in mice inoculated with 
UC5-1 sample treated with Amp and correlated with the number 
of colonic Th1 7 cells (Figure 7C, marked in red). To isolate Th1 7- 
inducing bacterial species, we anaerobically cultured cecal con- 
tents from Amp-treated UC5-1 mice using various culture media 
and picked 192 colonies. BLAST searches of 16S rRNA gene 
sequences revealed that the isolates contained 20 strains that 
broadly covered the bacterial species colonizing Amp-treated 
UC5-1 mice (Figure 7C, marked in green). Their genomes were 
sequenced and a phylogenetic comparison was carried out 
using 19 ribosomal protein genes predicted from the assembled 
draft genome of each strain. This analysis revealed that the 20 
strains included diverse species, such as Ciostridium, Bifidobac- 
terium, Ruminococcus, and Bacteroides (Figures 7C and S7A; 
Table S1). 

To investigate whether the isolated 20 strains have the ability 
to induce Th17 cells, we introduced a mixture of 20 strains into 
GF IQI mice. In mice inoculated with the 20 strains, we observed 
a robust accumulation of Th17 cells in the colon (Figures 7D and 
7E). The 20-strain mix was also effective in Th1 7 induction in rats 
(Figure 7F). Similarly to SFB, the 20 strains enhanced expression 
of Nos2 and Duoxa2 in ECs and IgA and ILC3 responses (Fig- 
ure 7G), whereas no significant effect was observed forThI cells 
in the colon (Figures 7D and 7E). To examine whether these 
20 Th17-inducing strains contained EC-adherent bacteria, we 
performed SEM and FISH. As a control, we examined mice 
colonized with 17 strains of human-derived Ciostridia that 
have the capacity to induce regulatory T cells (Tregs) (Atarashi 
et al., 2013). The 17 Treg-inducing strains of Ciostridia were 
not adhering and disappeared after washing. In contrast, we 
observed numerous bacteria adhering to the colonic ECs in 
mice and rats colonized with the 20 Th17-inducing strains (Fig- 
ures 7H and 7I), which is in line with our hypothesis that EC adhe- 
sion is an important signal forTh17 cell induction. 
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Figure 6. EC Adhesion-Mediated Th17 
Induction by Extracellular Pathogens 

(A) SEM images of colonic villi from IQI GF 
mice monocolonized with Asfx7Asfx2 or 
Asfx7Asfx2Aeae EHEC 0157:H7 for 3 weeks. 

(B) O antigen antisera (green) and DAPI (blue) 
staining for EHEC 0157 visualization. Scale bar, 
50 lam. 

(C-E) Th17 and Th1 cell proportions in the 
colonic LP CD4^ T cells from IQI mice inoculated 
with EHEC 0157:H7 (C), C. albicans (D), or 
L. nnonocytogenes (E). 

(F) SEM images of the colon of IQI GF mice and 
mice infected orally with L. nnonocytogenes. 

Error bars represent SD. See also Figure S6. 
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DISCUSSION 

In the present study, we demonstrate that SFB, C. rodentium 
and EHEC 0157 promote the induction of intestinal Th17 cells 
through adhesion to intestinal ECs. Moreover, we isolated 20 
bacterial strains from human intestine that can induce Th17 



cells in the mouse and rat intestine, 
and these 20 strains displayed similar 
EC adhesive characteristics. Therefore, 
the intestinal immune system mounts 
Th17 cell responses through recogni- 
tion of a physical interaction with the 
microbes, rather than by recognition 
of released microbial components or 
soluble metabolites. EC adhesion by 
microbes was also associated with in- 
creases in intestinal IgA'^ cells. Consis- 
tent with our observation, it has been 
shown that potentially colitogenic bacte- 
ria that can invade the inner mucus layer 
and colonize near the epithelium induce 
high-affinity IgA responses and become 
highly coated with IgA (Palm et al., 
2014). In contrast to Th17 cells and 
IgA^ cells, IL-22"^ ILCSs can be gener- 
ated by nonadhering commensals. 
Therefore, Th17 and IgA"^ cells control 
EC-adhering microbes, whereas IL-22'^ 
ILCSs may play a more general role in 
regulation of gut microbiota. The vast 
majority of gut microbiota are physically 
separated by the mucus layer (Johans- 
son et al., 2008; Vaishnava et al., 
2011), whereas microbes that cross the 
mucus barrier and attach to the epithe- 
lial lining are in many cases pathogenic 
(SFB and the 20 human strains are 
exceptional in that they can cross the 
mucus layer and attach to ECs, but do 
not penetrate the EC layer). Therefore, 
it seems reasonable to propose that 
the host immune system has evolved 
to discriminate the biogeographical distributions of intestinal 
microbes. 

Several reports have demonstrated that adhering SFB induce 
actin reorganization in SI ECs (Davis and Savage, 1974; Jepson 
et al., 1993). In this context, morphologically similar cytoplasmic 
alterations in intestinal ECs have been described in infections 
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with EHEC 0157 and C. rodentium (Croxen and Finlay, 2010). It 
is noteworthy that F-actin inhibition resulted in potentiated 
C/EBP6 and SAA1 expression in aMoS7 cells cocultured with 
M-SFB (Figure S3F). It is interesting to note that SFB have puta- 
tive ADP-ribosyl transferases that have sequence similarities to 
the Clostridium perfringens iota toxin, which ribosylates G-actin 
and thereby inhibits its polymerization (Pamp et al., 2012). 
Furthermore, because ROS suppresses the activity of certain 
Rho GTPase family members (Stanley et al., 2014), ROS pro- 
duced by the Duox2/Duoxa2 system in ECs may, in addition or 
instead, affect actin reorganization. In any case, there may be 
a link between actin cytoskeletal modulation by adherent SFB 
and subsequent gene induction in ECs, possibly through mecha- 
nosensing mechanisms (Dupont et al., 2011). 

Since recombinant SAA1 activated CD1 1 c"^ cells to upregulate 
IL-1p expression (Figure 3F), and recombinant IL-1p activated 
ECs to potentiate expression of SAAs (Figure 4G), our results 
suggest a model in which the initial increase in SAAs in SI ECs 
caused by SFB adhesion is then augmented via an amplification 
loop of SAA1 and IL-1p between SI ECs and CDUc"^ myeloid 
cells (Figure S4I). This amplification processes may be required 
for the maintenance of high level expression of SAAs in SI ECs 
and IL-1 p in myeloid cells, which causes constitutive accumula- 
tion of Th17 cells. In an accompanying article published in this 
issue of Cell, Sano et al. (2015) show that SAAs can directly 
act on RORyf^ T cells to induce IL-1 7 expression. Derebe et al. 
(2014) showed that SAAs act as a transporter of retinol. There- 
fore, SAAs may exert their effects through multiple mechanisms. 
IL-22 derived from ILC3s also contributes to this loop via poten- 
tiation of SAA expression by SI ECs. IL-22 alone cannot suffi- 
ciently activate this amplification loop, as was shown that 
IL-22'^ ILC3s induced by luminal R-SFB cannot induce Th1 7 cells 
(Figure 2D). 

A substantial fraction of Th1 7 cells specifically recognized EC- 
adhesive M-SFB but not non-adhering R-SFB (Figures 2A 
and 2B). The simplest explanation for this observation is that an- 
tigens may be taken up by host cells only when SFB adhere to 
ECs. This process likely involves bacterial sampling by dendritic 
cells (DCs). In this context, recognition and phagocytosis of 
C. roc/enf/um- infected apoptotic ECs by LP DCs were shown 
to preferentially trigger Th17 cell differentiation (Torchinsky 
et al., 2009). SFB and other EC-adhesive microbes may be en- 
gulfed by LP DCs together with their adhering apoptotic ECs, 
processed and loaded onto MFIC class II molecules. SFB are 
innocuous members of the microbiota; however, SFB can be 
pathogenic, depending on the host genotype. For example, 
SFB have been implicated in T cell-mediated autoimmune dis- 
eases such as arthritis (Wu et al., 2010). In this context, we 
observed a striking genotype-specific difference upon monoco- 
lonization with EC-adhesive SFB in the induction of Th17 cells, 
with BALB/c mice having fewerTh17 cells in comparison to B6 
and IQI mice. Our observation in BALB/c mice provides one 
remarkable example showing that the combination of genetic 
background and composition of the gut microbiota affects 
T cell status. Genotype-specific differences in the differentiation 
of T cells in response to a component of the gut microbiota 
may explain the phenotypic variations in autoimmune disease 
susceptibility. 



Different from the host-specific Th1 7 induction by SFB, the 20 
bacterial strains isolated from a human fecal sample were Th1 7- 
inducing in at least two different mammalian hosts (mice and 
rats. Figures 7E and 7F); therefore, they may function in humans 
and may act as pathobionts, because they were derived from 
UC patients. However, we did not observe any inflammatory 
changes in the intestines of mice inoculated with the 20 strains 
(data not shown). Furthermore, mining of publicly available 
data from the MetaHIT project (Qin et al., 2010) revealed that 
only three out of the 20 strains were significantly increased in 
the microbiomes of UC and Crohn’s disease subjects (Fig- 
ure S7B). Moreover, the 20 strains were not abundant in mice 
inoculated with a fecal sample from subject UC4-2, in which 
Th17 cells were induced similarly to UC5-1 microbiota-associ- 
ated mice (Figure S7C). Therefore, the 20 strains may not be 
essential components for Th17 induction nor contribute to the 
pathogenesis of IBD in humans. Our study design has some lim- 
itations in this context, including the limited number of subjects 
tested and the fact that robust activity in mice may not be phys- 
iologically relevant in humans and vice versa. Nevertheless, this 
is a clear demonstration of Th17 cell induction by human-asso- 
ciated bacterial species and therefore represents a significant 
step forward in dissecting human intestinal microbiota biology. 
The intestinal Th17 responses to commensal and pathogenic 
bacteria with shared EC-adhering characteristics have obvious 
clinical implications in the context of IBD, vaccine design, and 
probiotics. 

EXPERIMENTAL PROCEDURES 

Materials and experimental procedures are detailed in the Supplemental 
Experimental Procedures. 

Animals and Microbes 

GF mice and rats were purchased from CLEA and Sankyo Labe Service and 
kept in the GF facility of Yakult Central Institute or RIKEN Yokohama Institute. 
To generate SFB-monocolonized animals, cecal or fecal suspensions from 
SFB-monocolonized animals were orally inoculated into GF animals. The 
C. rodentium Aeae and E. coli 0157:H7 Asfx7Asfx2 mutant strains have 
been described previously (Kamada et al., 2012; Nagano et al., 2003). To 
generate the E. coli 0157:1-17 Asfx7Asfx2Aeae mutant strain, the eae gene 
was replaced with the tetracycline-resistance gene by homologous recombi- 
nation. Approximately 1x10® colony-forming unit (CFU) of C. rodentium, 
E. coli 0157, or L monocytogenes was orally administered into GF mice. 
C. albicans (TUA6) was provided by Dr. Takashi Umeyama (National Institute 
of Infectious Diseases, Japan) and orally administered (-^1 x 10® CFU) into 
GF mice. 

qPCR and 16S rRNA Gene Pyrosequencing 

To quantify bacterial load, bacterial DNA was isolated from intestinal contents 
or mucosal tissues using a QIAamp Stool Mini Kit, and qPCR was carried out 
using universal or bacterial strain-specific primers for 16S rRNA genes. For 
16S rRNA gene pyrosequencing, cecal contents were incubated with lyso- 
zyme and achromopeptidase. Sodium dodecyl sulfate and subsequently pro- 
teinase K was added to the suspension and high-molecular-mass DNA was 
purified by phenol/chloroform extraction. PCR was performed using primers 
to the VI -V2 region of the 16S rRNA gene, and the amplified DNA was used 
as template for 454 GS Junior sequencing. 

Isolation of Human-Derived Th17-lnducing Bacterial Strains 

Stool samples from patients with UC and from healthy donors were obtained 
and inoculated into IQI GF mice. The cecal contents from Amp-treated UC5-1 
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ID Similarity Closest genome-sequenced reference strain 
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81.71 
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1E3 


99.93 


1A9 


99.85 


2G11 


99.89 


2E1 


83.30 


1F7 


99.38 


1D2 


90.18 



Coprobacillus sp. D6 contI .1 
Bifidobacterium animalis subsp. lactis ATCC 27673 
Clostridium sp. ASF356 
Clostridium cf. saccharolyticum K1 0 
Eubacterium sp. 3_1_31 contl.1 
Erysipelotrichaceae bacterium 21_3 
Firmicutes bacterium ASF500 
Ruminococcus bromii L2-63 
Ruminococcus gnavus AGR21 54 
Subdoligranulum sp. 4_3_54A2FAA 
Firmicutes bacterium ASF500 
Clostridium hathewayi 12489931 
Bifidobacterium breve UCC2003 
Ruminococcus obeum A2-1 62 
Coprobacillus sp. D6 contI .1 
Clostridium symbiosum WAL-14673 
Bacteroides dorei 5_1_36/D4 supercont2.3 
Butyrate-producing bacterium SSC/2 
Clostridium bolteae 90B3 
Clostridium bolteae 90B3 



■ OTUs positively correlated with the number of Th17 cells 

■ OTUs negatively correlated with the number of Th17 cells 
H OTUs corresponding to the isolated 20 strains 





(legend on next page) 
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mice were cultured on Schaedler, BHI, GAM, CM0151, BL, or TS agar plates 
under strictly anaerobic conditions. Individual colonies were picked and iden- 
tified by sequencing of the 16S rRNA gene fragment. 

Statistical Analysis 

All statistical analyses were performed using Prism software with two-tailed 
unpaired Student’s t test or one-way ANOVA followed by Tukey’s post hoc 
test, p values < 0.05 were considered significant (*p < 0.05, **p < 0.01 , and 

***p< 0.001). 

ACCESSION NUMBERS 

The accession number for the RNA sequencing data reported in this paper 
is NCBI GEO: GSE71734. The draft genome sequences of Th17-inducing 
20 strains can be retrieved under DDBJ BioProject ID: PRJDB 4119-4138 
and the 16S rRNA gene sequencing data under DDBJ BioProject ID: 
PRJDB4113. 

SUPPLEMENTAL INFORMATION 

Supplemental Information includes Supplemental Experimental Procedures, 
seven figures, and one table and can be found with this article online at 
http://dx.d 0 i. 0 rg/l 0. 1 01 6/j.cell.201 5.08.058. 
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Figure 7. Th17 Induction and EC Adhesion by 20 Bacterial Strains Derived from the Human Intestine 

(A) Th17 cell frequencies in IQI GF mice colonized with stool from patients with UC or healthy adults. 

(B) Th17 cell frequencies in UC5-1 or H23-colonized IQI mice either untreated (cont) or treated with the indicated antibiotics. 

(C) Cecal microbiota compositions of each mouse (n = 4 or 5 per group). OTUs positively correlated with the frequency of Th1 7 cells are marked in red and those 
negatively correlated marked in blue. OTUs corresponding to the isolated 20 strains are marked in green. ID, isolated strain ID. See also Table SI. 

(D-F) The percentage of Th17 and Th1 cells in the colonic LP of B6 mice (D and E) and F344 rats (F) colonized with the mixture of 20 strains (20st.). 

(G) qPCR for Nos2 and Duoxa2 in colonic EC (upper panels) and FACS for LP IgA^ cells and IL-22^ ILC3s (lower panels) of B6 mice colonized with the 20 strains. 
(H and I) SEM images and FISH staining with EUB338 of the proximal colon of mice and rats colonized with the 20 Th17-inducing strains versus the 17 Treg- 
inducing strains. Scale bar, 100 i^m. 

Error bars represent SD. See also Figure S7. 



Cell 163, 1-14, October 8, 2015 ©2015 Elsevier Inc. 13 




Please cite this article in press as: Atarashi et al., Th1 7 Cell Induction by Adhesion of Microbes to Intestinal Epithelial Cells, Cell (201 5), http:// 
dx.doi.org/1 0. 1 01 6/j.cell.201 5.08.058 



Cell 



Johansson, M.E., Phillipson, M., Petersson, J., Velcich, A., Holm, L., and Hans- 
son, G.C. (2008). The inner of the two Muc2 mucin-dependent mucus iayers in 
colon is devoid of bacteria. Proc. Natl. Acad. Sci. USA 105 , 15064-15069. 

Kamada, N., Kim, Y.G., Sham, H.P., Vaiiance, B.A., Puente, J.L, Martens, 

E. C., and Nunez, G. (2012). Regulated virulence controls the ability of a path- 
ogen to compete with the gut microbiota. Science 336 , 1325-1329. 

Kiaasen, H.L., Koopman, J.P., Van den Brink, M.E., Bakker, M.H., Poelma, 

F. G., and Beynen, A.C. (1993). Intestinai, segmented, fiiamentous bacteria in 
a wide range of vertebrate species. Lab. Anim. 27, 141-150. 

Mangan, P.R., Harrington, L.E., O’Quinn, D.B., Helms, W.S., Bullard, D.C., El- 
son, C.O., Hatton, R.D., Wahl, S.M., Schoeb, T.R., and Weaver, C.T. (2006). 
Transforming growth factor-beta induces development of the T(H)17 lineage. 
Nature 441 , 231-234. 

Mortha, A., Chudnovskiy, A., Hashimoto, D., Bogunovic, M., Spencer, S.P., 
Beikaid, Y., and Merad, M. (2014). Microbiota-dependent crosstalk between 
macrophages and ILC3 promotes intestinal homeostasis. Science 343 , 
1249288. 

Nagano, K., Taguchi, K., Hara, T., Yokoyama, S., Kawada, K., and Mori, H. 
(2003). Adhesion and coionization of enterohemorrhagic Escherichia coii 
0157:H7 in cecum of mice. Microbioi. Immunol. 47 , 125-132. 

Palm, N.W., de Zoete, M.R., Cullen, T.W., Barry, N.A., Stefanowski, J., Hao, L., 
Degnan, P.H., Hu, J., Peter, I., Zhang, W., et al. (2014). Immunoglobulin A 
coating identifies colitogenic bacteria in inflammatory bowel disease. Cell 
158 , 1000-1010. 

Pamp, S.J., Harrington, E.D., Quake, S.R., Reiman, D.A., and Biainey, P.C. 
(2012). Singie-ceii sequencing provides ciues about the host interactions of 
segmented fiiamentous bacteria (SFB). Genome Res. 22, 1107-1119. 

Prakash, T., Qshima, K., Morita, H., Fukuda, S., imaoka. A., Kumar, N., 
Sharma, V.K., Kim, S.W., Takahashi, M., Saitou, N., et ai. (2011). Compiete 
genome sequences of rat and mouse segmented fiiamentous bacteria, a 
potent inducer of th17 ceii differentiation. Ceii Host Microbe 10 , 273-284. 

Qin, J., Li, R., Raes, J., Arumugam, M., Burgdorf, K.S., Manichanh, C., Nieisen, 
T., Pons, N., Levenez, F., Yamada, T., etai.; MetaHiT Consortium (2010). A hu- 
man gut microbiai gene catalogue established by metagenomic sequencing. 
Nature 464 , 59-65. 

Sano, T., Huang, W., Haii, J.A., Yang, Y., Chen, A., Gavzy, S.J., Lee, J., Ziei, J., 
Miraidi, E.R., Domingos, A.I., et ai. (2015). An 23R/iL-22 circuit reguiates 
epitheiiai serum amyioid A to promote iocai effector Th17 responses. Ceii 
163 . Pubiished oniine September 24, 2015. http://dx.doi.0rg/IO.IOI6/j.ceil. 
2015.08.061. 



Shaw, M.H., Kamada, N., Kim, Y.G., and Nunez, G. (2012). Microbiota-induced 
iL-1 p, but not iL-6, is criticai for the deveiopment of steady-state TH1 7 ceiis in 
the intestine. J. Exp. Med. 209 , 251-258. 

Shen, Y., Yue, F., McCieary, D.F., Ye, Z., Edsail, L, Kuan, S., Wagner, U., 
Dixon, J., Lee, L., Lobanenkov, V.V., and Ren, B. (2012). A map of the cis-reg- 
uiatory sequences in the mouse genome. Nature 488 , 116-120. 

Spits, H., Artis, D., Coionna, M., Diefenbach, A., Di Santo, J.P., Eberi, G., 
Koyasu, S., Locksiey, R.M., McKenzie, A.N., Mebius, R.E., etai. (2013). Innate 
lymphoid celis-a proposai for uniform nomenclature. Nat. Rev. Immunol. 13 , 
145-149. 

Stanley, A., Thompson, K., Hynes, A., Brakebusch, C., and Quondamatteo, F. 
(2014). NADPH oxidase complex-derived reactive oxygen species, the actin 
cytoskeieton, and Rho GTPases in ceii migration. Antioxid. Redox Signal. 
20 , 2026-2042. 

Talham, G.L, Jiang, H.Q., Bos, N.A., and Cebra, J.J. (1999). Segmented 
fiiamentous bacteria are potent stimuli of a physioiogicaliy normai state of 
the murine gut mucosai immune system. Infect. Immun. 67 , 1992-2000. 
Torchinsky, M.B., Garaude, J., Martin, A.P., and Blander, J.M. (2009). Innate 
immune recognition of infected apoptotic ceiis directs T(H)17 ceii differentia- 
tion. Nature 458 , 78-82. 

Umesaki, Y., Setoyama, H., Matsumoto, S., Imaoka, A., and Itoh, K. (1999). 
Differential roles of segmented filamentous bacteria and Clostridia in develop- 
ment of the intestinal immune system, infect, immun. 67 , 3504-3511. 
Vaishnava, S., Yamamoto, M., Severson, K.M., Ruhn, K.A., Yu, X., Keren, Q., 
Ley, R., Wakeiand, E.K., and Hooper, L.V. (2011). The antibacterial lectin 
Regiiigamma promotes the spatiai segregation of microbiota and host in the 
intestine. Science 334 , 255-258. 

Weaver, C.T., Eison, C.Q., Fouser, LA., and Koiis, J.K. (2013). The Th17 
pathway and inflammatory diseases of the intestines, iungs, and skin. Annu. 
Rev. Pathoi. 8 , 477-512. 

Wu, H.J., ivanov, i.l., Darce, J., Hattori, K., Shima, T., Umesaki, Y., Littman, 
D.R., Benoist, C., and Mathis, D. (2010). Gut-residing segmented fiiamentous 
bacteria drive autoimmune arthritis via T heiper 17 ceiis. Immunity 32 , 
815-827. 

Yang, Y., Torchinsky, M.B., Gobert, M., Xiong, H., Xu, M., Linehan, J.L., 
Aionzo, F., Ng, C., Chen, A., Lin, X., et ai. (2014). Focused specificity of intes- 
tinai TH1 7 ceiis towards commensai bacteriai antigens. Nature 570, 1 52-1 56. 
Yokoyama, S., Suzuki, T., Shiraishi, S., Qhishi, N., Yagi, K., Ichihara, S., itoh, 
S., and Mori, H. (2001). Construction of deletion mutants of Shiga (-Like) toxin 
genes (stx-1 and/or stx-2) on enterohemorrhagic Escherichia coii (Q157: H7). 
J. Ciin. Biochem. Nutr. 30 , 33-42. 



14 Cell 163, 1-14, Octobers, 2015 ©2015 Elsevier Inc. 




Article 



Cel 

An IL-23R/IL-22 Circuit Reguiates Epitheiiai Serum 
Amyioid A to Promote Locai Effector Th17 
Responses 



Graphical Abstract 



Day 3-5 after SFB gavage 
Duodenum 



Day 7-14 after SFB gavage 
Duodenum 




ILC3 



>L-23^ / ^^SAA1/2 



Naive 
Th cell 





Mesenteric 
Lymph node 



ILC3 



Colon 



' MigrationT?"'/ M^Rvt+ 

into -tI/t 

Kill M ^ Th17 

»^MLN 

1st 5tep 

(RORyt expression)' 



O ^ 

IL-17A- RORyr Th17 



* *JsAa1/2 



IL-17A^ RORyr Th17 
2"d step; 

(IL17A production) 



Colon 




<% 

ILC3 



O ^ @ ® 

IL-17A-RORYMh17 



Authors 

Teruyuki Sano, Wendy Huang, 

Jason A. Hall, ..., Ana I. Domingos, 
Richard Bonneau, Dan R. Littman 

Correspondence 

dan.littman@med.nyu.edu 

In Brief 

Cytokines produced by Th17 ceiis are 
critical for protection and repair of 
mucosal barriers. IL-17A production is 
orchestrated by locaiization of specific 
poised Th17 ceiis at sites where the gut 
microbiome drives epitheiiai secretion of 
serum amyloid A. 



Highlights Accession Numbers 

• SFB-specific Th1 7 cells are primed in MLN and traffic GSE70599 

throughout the Gl tract 

• IL-17A-producing Th17 ceils are enriched in iieum, where 
SFB adheres to epitheiium 

• SAA1/2 from ileal epithelial cells contributes the induction of 
IL-17A in Th17 ceiis 



• SFB-induced IL-22 production by ILC3 stimulates epithelial 
cells to make SAA1 /2 



Sano et al., 2015, Cell 163, 1-13 
October 8, 201 5 ©201 5 Elsevier Inc. 
http://dx.d 0 i. 0 rg/l 0.1 01 6/j.cell.201 5.08.061 



CelPress 



Please cite this article in press as: Sane et al., An IL-23R/IL-22 Circuit Regulates Epithelial Serum Amyloid A to Promote Local Effector Th17 
Responses, Cell (2015), http://dx.doi.Org/10.1016/j.cell.2015.08.061 



Article 



Cell 



An IL-23R/IL-22 Circuit Reguiates 

Epitheiiai Serum Amyioid A 

to Promote Locai Effector Th17 Responses 

Teruyuki Sano,^’^ Wendy Huang, Jason A. Hall,^ Yi Yang,^’^ Alessandra Chen ,^’2 Samuel J. Gavzy,^ June-Yong Lee,^ 
Joshua W. Ziel,^ Emily R. Miraldij^-^ ^.s | Domingos,® Richard Bonneau,®’4,5 and Dan R. Littman ^’2 * 

■'Molecular Pathogenesis Program, The Kimmel Center for Biology and Medicine of the Skirball Institute, New York University School of 
Medicine, New York, NY 10016, USA 

^The Howard Hughes Medical Institute, New York University School of Medicine, New York, NY 10016, USA 

^Center for Genomics and Systems Biology, Department of Biology, New York University, New York, NY 10003, USA 

^Courant Institute of Mathematical Sciences, Computer Science Department, New York University, New York, NY 10003, USA 

^Simons Center for Data Analysis, Simons Foundation, New York, NY 10010, USA 

^Obesity Laboratory, Institute Gulbenkian de Ciencia, Oeiras 2780-156, Portugal 

^Co-first author 

^Present address: Department of Microbiology and Immunology, Medical University of South Carolina, 173 Ashley Avenue, Charleston, 
SC 29425, USA 

*Correspondence: dan.littman@med.nyu.edu 
http://dx.d 0 i. 0 rg/l 0.1 01 6/j.cell.201 5.08.061 



SUMMARY 

RORyf^ Th17 cells are important for mucosal de- 
fenses but also contribute to autoimmune disease. 
They accumulate in the intestine in response to mi- 
crobiota and produce IL-17 cytokines. Segmented 
filamentous bacteria (SFB) are Th17-inducing com- 
mensals that potentiate autoimmunity in mice. 
RORyf^ T cells were induced in mesenteric lymph 
nodes early after SFB colonization and distributed 
across different segments of the gastrointestinal 
tract. However, robust IL-17A production was 
restricted to the ileum, where SFB makes direct con- 
tact with the epithelium and induces serum amyloid A 
proteins 1 and 2 (SAA1/2), which promote local IL- 
17A expression in RORyf^ T cells. We identified an 
SFB-dependent role of type 3 innate lymphoid cells 
(ILC3), which secreted IL-22 that induced epithelial 
SAA production in a Stat3-dependent manner. This 
highlights the critical role of tissue microenvironment 
in activating effector functions of committed Th17 
cells, which may have important implications for 
how these cells contribute to inflammatory disease. 

INTRODUCTION 

The vertebrate gastrointestinal (Gl) tract is colonized by hun- 
dreds of distinct species of microorganisms that collectively 
maintain a mutualistic relationship with the host (Maepherson 
and Harris, 2004). This mutualism is critically dependent on a 
state of balanced immune activation, which fosters cohabitation 
between the host and microbiota, while providing optimal pro- 
tection against opportunistic pathogens (Honda and Littman, 
2012). It is now appreciated that the composition of the micro- 



biome can contribute significantly to this immunological balance, 
in part through the capacity of individual bacterial or viral species 
to profoundly influence distinct arms of the immune response 
by themselves or in concert with other microbial species (Hooper 
et al., 2012; Virgin, 2014). For instance, Bacteroides fragilis 
and mixtures of various strains of Clostridia, which colonize 
the large intestine, have been shown to promote intestinal and 
systemic immune tolerance through regulatory T cells (Atarashi 
et al., 2013; Lathrop et al., 2011; Round et al., 2011), and 
segmented filamentous bacteria (SFB) induce antigen-specific 
T-helper-17 cells (Th17) cell differentiation in the small intestine 
(Gaboriau-Routhiau et al., 2009; Ivanov et al., 2009; Yang 
et al., 2014). 

Th17 cells constitute a subset of activated CD4'^ T cells that 
are characterized by the production of the cytokines interleukin 
(IL)-17A, IL-17F, and IL-22 (Korn et al., 2009). These cytokines 
act on a broad range of hematopoietic and non-hematopoietic 
cells to regulate central aspects of host immunity, including 
granulopoiesis, neutrophil recruitment, and the induction of anti- 
microbial peptides (Weaver et al., 2007). Th1 7 cells thus execute 
important functions in both epithelial homeostasis and host de- 
fense against various extracellular pathogens, such as Candida 
albicans and Pseudomonas aeruginosa (Stockinger and Veld- 
hoen, 2007). Conversely, over-exuberant Th17 responses may 
promote auto-inflammatory diseases, such as Crohn’s disease, 
rheumatoid arthritis (RA), psoriasis, and multiple sclerosis (MS) 
(Furuzawa-Carballeda et al., 2007). While genetic polymor- 
phisms significantly factor into the onset of these diseases, 
emerging evidence also highlights the influence that environ- 
mental factors, such as diet and microbial composition, can 
impose on such propensities. Accordingly, recent studies have 
illustrated the potential of SFB to exacerbate Th17-mediated 
disease in murine autoimmune models of both RA and MS 
(Lee et al., 2011; Wu et al., 2010), although the intermediate 
molecular steps connecting SFB to a distal immune response 
are ill defined. 
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Figure 1. Site of SFB Colonization Correlates with Maximal IL-17A Induction in RORyf^ T Cells 

(A) Quantitative PCR (qPCR) anaiysis of SFB genomic 16S in duodenum, iieum, and coion (proximai and distai segments of each) from SFB-coionized (SFB®; 
gavaged with SFB-rich fecai contents) or SFB-free (JAX; gavaged with SFB-free fecai contents) mice at 7 and 14 days post-gavage. Copy number of SFB 16S 
was normaiized by copy number of mouse genomic ll-23r gene. The experiment was repeated more than three times with simiiar resuits combined. Data are 
represented as mean with SD. 

(B) Number of RORyf^ CD4^ T ceiis in intestinai iamina propria (LP) or MLN of SFB-coionized (SFB®; n = 7) or SFB-free (JAX; n = 7) mice on 7 and 14 days post- 
gavage. CD4'^ T ceiis were isoiated from the indicated tissues, and Foxp3“ RORyC CD4'^ T ceiis were quantified by fluorescence-activated cell sorting (FACS). 

(legend continued on next page) 
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SFB colonization of the small intestine promoted global tran- 
scriptional changes in host epithelia, including the induction 
of antimicrobial peptides and stress response genes, such as 
serum amyloid A (SAA1 and SAA2) (Ivanov et al., 2009). SAA is 
typically induced in response to infection and acute injury and 
can promote inflammation, in part through elicitation of proin- 
flammatory cytokine production and recruitment of granulo- 
cytes, monocytes, and T lymphocytes (Uhlar and Whitehead, 
1999). The effect of SAA on the immune response is context 
driven (Cray et al., 2009; Eckhardt et al., 2010; Ivanov et al., 
2009), much like that of Th1 7 cells. Insofar as SFB and Th1 7 cells 
are intertwined, the question of whether SAA impacts aspects of 
Th17 biology remains to be addressed. 

Th17 cells along with several other innate-like cell lineages, 
including specific subsets of y6 T cells (yhM) and type 3 innate 
lymphoid cells (ILC3), are regulated by the transcription factor 
RAR-related orphan receptor gamma (RORyt) (Chien et al., 
2013; Ivanov et al., 2006; Spits and Di Santo, 2011). However, 
in contrast to the requirement for antigen recognition in the 
context of major histocompatibility complex (MHC) to drive 
Th1 7 cell activation, y61 7 and ILC3 effector functions are elicited 
independently of antigen presentation. For example, the pro- 
inflammatory cytokine IL-23 triggers rapid IL-17 and IL-22 
secretion by y617 cells and ILC3s, respectively, upon ligation 
of the highly expressed IL-23 receptor (IL-23R). As y617 cells 
and ILC3s often reside in proximity to exposed mucosal sur- 
faces, their activation typically precedes antigen-specific T cell 
differentiation and recruitment (Martin et al., 2009; Sutton 
et al., 2009). Whether this has bearing on the function of newly 
recruited T cells is unclear. 

We utilized models of acute SFB colonization to investigate 
the mechanism of Th17 cell induction in a spatiotemporal 
context. We found that, following early induction of SFB-specific 
RORyf^ Th17 cells in the mesenteric lymph nodes, there was 
distribution of such cells throughout the length of the gut, from 
duodenum to colon, but IL-17A expression was largely confined 
to the terminal ileum, the site of SFB attachment to epithelium 
(Ivanov et al., 2009). We have identified an SFB-triggered circuit 
in which ILC3 secretion of IL-22 is critical for local epithelial 
production of SAA1 and SAA2, which act directly on poised 
Th17 cells to amplify effector cytokine production. These find- 
ings suggest that tissue microenvironments contribute to the 
acquisition of effector functions by polarized activated effector 
and memory cells. 

RESULTS 

Selective IL-17A induction in RORyt^ T Cells in Ileum of 
SFB-Colonized Mice 

SFB colonization results in a striking increase in both the number 
and proportion of Th1 7 cells among total CD4^ T cells within the 



small intestine lamina propria (SILP) (Ivanov et al., 2009). To 
explore the mechanism of Th17 cell induction, we orally intro- 
duced fecal contents from Rag2~'~ mice, which 

have about 100-fold enriched fecal SFB (SFB®) compared to 
wild-type (WT) animals (Yang et al., 2014), into SFB-free (JAX) 
mice and subsequently traced the kinetics of Th17 cell differen- 
tiation in different compartments of the Gl tract on days 7 and 1 4. 
As expected, SFB colonization was most abundant in the termi- 
nal ileum (Figures 1A and S1A). After 5 days of SFB® gavage, 
we observed a marked increase in proportion and number of 
CD4'^ T cells expressing RORyt, which defines the Th17 cell 
program, in the mesenteric lymph nodes (MLN) that drain the in- 
testine (Figure S1 B). Adoptively transferred carboxyfluorescein 
succinimidyl ester (CFSE)-labeled SFB-specific T cells from 
7B8 T cell receptor transgenic mice (Yang et al., 2014) prolifer- 
ated and upregulated RORyt expression in the MLN by day 4 
after SFB colonization (Figure S1C). Expansion of these cells 
was then observed throughout the SILP by 7 or 14 days post- 
SFB gavage (Figures IB and S1C). Using fluorescently labeled 
MHC class II tetramers with a bound immunodominant SFB 
antigen, we found SFB-specific RORyf^ cells throughout the 
gut; however, they were enriched in the ileum, possibly reflecting 
additional expansion in response to local antigen (Figure S1 D). 

To track expression of IL-17A, we colonized re- 

porter mice with SFB®. Strikingly, co-staining of RORyt and 
GFP indicated that the majority of RORyt^ cells expressed IL- 
17A mRNA only in the terminal ileum, but not in other regions 
of the Gl tract (Figures 1 0 and 1 D). Induction of IL-1 7A in poised 
Th17 cells thus appears to occur primarily at the site of SFB 
colonization. 

Serum Amyloid A Proteins Modulate Th17 Cell Cytokine 
Production 

Recent studies have shown that the tissue microenvironment 
confers distinct transcriptional and functional properties to 
otherwise similar lymphoid and myeloid cells (Burzyn et al., 
2013; Cipolletta et al., 2012; Lavin et al., 2014; Lewis et al., 
2011). Our observation of the spatial restriction of IL-17A- 
producing Th17 cells raised the possibility that SFB induces 
changes in the microenvironment of the underlying lamina 
propria, thus contributing to the differentiation state of poised 
Th17 cells. Consistent with this possibility, germ-free (GF) mice 
monocolonized with SFB upregulated expression of several 
secreted molecules in the terminal ileum, among which SAA1 
and SAA2 (referred to as SAA1/2, as they are highly homologous) 
were the most strongly induced (Ivanov et al., 2009). Because 
SFB attaches to the epithelium, we focused on secreted mole- 
cules that changed expression within ileal epithelial cells for 
their potential influence on the functional status of adjacent 
Th17 cells. To identify epithelium-specific translated RNAs, 
we employed in vivo translating ribosome affinity purification 



(C) IL-17A (GFP^) production in Foxp3“ RORyt^ CD4'^ T ceiis from intestinai LPs and MLN in iL-17A reporter (//77a^^'°^^^'°) mice at 14 days post-SFB coionization. 
iL-17A-producing Th17 ceiis were monitored by FACS after staining with anti-GFP antibody without re-stimuiation. 

(D) The proportion of IL-1 7A producing Th1 7 cells among Foxp3“ RORyf^ CD4^ T cells following SFB colonization. Each dot represents a single mouse, and the 
orange bar indicates average of relative gene expression of each group. Data were from at least two independent experiments (JAX; n = 7. SFB®; n = 9). 

N.S., not significant (p > 0.05); *p < 0.05; **p < 0.01 ; ***p < 0.001 ; and ****p < 0.0001 . See also Figure SI . 
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Figure 2. Role of Epithelial SAA1/2 in Ileal Th17 Cell Production of IL-17A 

(A) Translating ribosome affinity purification (TRAP) assay for identification of the cell type expressing SAA1/2. Four individual lines of TRAP mice, conditionally 
expressing a GFP tagged ribosome L10 fusion protein in intestinal epithelial (Villin-Cre), T (CD4-Cre), myeloid (CD11c-Cre), or liver (Albumin-Cre) cells were 
generated. After 2 weeks of SFB colonization, ribosomes in small intestinal lysates collected from each line were enriched by GFP pull-down. Actively translating 
SAA1/2 mRNAs were quantified by qRT-PCR. Black and white bar indicates TRAP signal from SFB-free animals and light blue, dark blue, purple, and orange bars 
indicate TRAP signal from SFB-positive animals. White bar indicates TRAP signal from tissue harvested from JAX animals at 6 hr following intraperitoneal 
administration of LPS. 

(B) Quantification of SAA1/2 mRNAin intestinal epithelial cells of duodenum, ileum, and colon from SFB® (day 7; n = 8, day 14; n = 3), pure SFB (n = 4), JAX (n = 3), 
or PBS-gavaged (n = 8) mice. Data were normalized using Gapdh. Similar experiments were performed three times. 

(legend continued on next page) 
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(TRAP), which was developed to identify cell type-specific tran- 
scripts in tissues containing heterogeneous cell populations 
(Heiman et al., 2014). TRAP Villin-Cre, TRAP CD11c-Cre, TRAP 
CD4-Cre, and TRAP Albumin-Cre mice expressing a condition- 
ally induced GFP ribosome L10 fusion protein from the rpHO 
locus in epithelial cells, dendritic cells, T cells, and hepatocytes, 
respectively, were colonized with SFB for 2 weeks or left 
untreated, after which GFP pull-down of tissue lysates and 
qRT-PCR using transcript-specific primers were performed to 
identify translating mRNAs in specific cell types. Using primers 
that amplify both SAA1 and SAA2 mRNAs, we found that both 
were highly transcribed and translated in epithelial cells from 
SFB^ mice (Figure 2A), consistent with previous reports of SAA 
protein expression in small and large intestinal epithelial cells 
(Derebe et al., 2014; Eckhardt et al., 2010). We next assessed 
the spatiotemporal induction of SAA1/2 mRNAs in the epithelial 
fraction of the intestine upon acute colonization with SFB (Fig- 
ure 2B). Strikingly, SAA1/2 mRNA induction was confined to 
the terminal ileum and was modulated in synchrony with SFB 
colonization (Figures 1A and S1A). Differential SAA expression 
was confirmed by immunoblotting of the EDTA-epithelial cell- 
rich fractions from different regions of the Gl tract of animals 
with or without SFB colonization (Figure 2C). Notably, this 
localization corresponds to the site of SFB attachment and the 
location of IL-17A-producing Th17 cells. 

We next asked whether SAA1/2 contribute to functional 
differentiation of Th17 cells in vivo. Wild-type (WT) or SAA1 
and SAA2 (DKO) mutant mice (Eckhardt et al., 2010) were 
crossed with reporter mice in an SFB-free back- 

ground. Seven days after SFB® gavage, WT and DKO litter- 
mates were colonized at similar levels (Figure S2A). On day 7, 
at the peak of the Th17 response, we observed in WT and 
DKO animals comparable proportions of FoxpS"^®^ RORyf" 
004"^ T cells in different segments of the Gl tract. Within the ter- 
minal ileum, however, production of IL-17A-GFP in the poised 
Th17 cells was significantly reduced in DKO mice, both propor- 
tionally and on a per-cell basis (Figures 2D and 2E). In addition, 
IL-17A and IL-17F mRNAs in the SILP of DKO animals were 
significantly reduced relative to WT littermates (Figure S2A). 
Unlike IL-17A and IL-17F, the chemokine receptor CCR6, 
a marker of Th17 cells, was expressed in similar amounts in 
wild-type and SAA1/2DKO animals (Figure S2B), consistent 
with SAA1/2 contributing to a specific subset of genes in the 
Th17 cell program. 

To exclude a possible role for SAA1/2 in T cell development or 
repertoire bias, we colonized WT and DKO hosts with SFB® feces 
and adoptively transferred into them naive CD4^ T cells from 
7B8 transgenic mice (Yang et al., 201 4). One week post-transfer, 
we noted that comparable proportions of donor-derived cells 



became RORyt^ within the terminal ileum of both DKO and WT 
animals. However, production of IL-1 7A by the RORyf^ 7B8 cells 
upon PMA/lonomycin restimulation was markedly attenuated in 
DKO recipients (Figure 2F). Similar results were observed in WT 
and DKO recipients of T cells from 7B8 transgenic mice bred 
onto the IL17A-IRES-GFP reporter background (Figure S2C). 
Although almost all SFB-specific donor-derived T cells found in 
the ileum of SFB-colonized mice expressed RORyt regardless 
of the host genotype, we observed a reduction in total number 
of these cells recovered in SAA1/2 DKO animals (Figure 2F). 
This suggests that, in addition to a role in promoting Th17 cyto- 
kine production, SAA may promote local Th17 cell proliferation 
and/or retention. 

Next, we treated murine naive CD4^ T cells cultured in Th17 
polarizing conditions with recombinant mouse SAA1 (rmSAAl). 
SAA1 significantly enhanced IL-17A and IL17-F production in 
RORyf^ CD4^ T cells cultured in suboptimal Th17 polarizing 
conditions (Figures 3A-3C and S3A). Importantly, this effect 
was reversed when anti-mSAA blocking antibody was intro- 
duced (Figure S3A). In contrast, CCR6 expression was not 
changed by rmSAAl treatment (Figure S3B). Based on our 
in vivo observation, we also asked whether mSAA1 exerts a 
direct effect on cell proliferation in addition to Th17 signature 
cytokine production. Labeling of naive CD4'^ T cells with CFSE 
and intracellular Ki67 staining showed enhanced proliferation of 
RORyt^ Th17 cells upon treatment with rmSAAl (Figures 3A- 
3C and S3C), but the higher proportions of cells expressing 
IL-17A and IL-17F were observed independently of the extent 
of proliferation (Figure 3C). Furthermore, in vitro differentiation 
of naive CD4'^ T cells toward Th1, Th2, and iTregs was not 
significantly affected in the presence of recombinant mouse 
SAA1 (Figure S3D). Similarly, rhSAA augmented IL-17A produc- 
tion during human Th17 cell priming (Figure S3E). Importantly, 
this effect was reversed when SAA blocking antibody was 
introduced into culture. These results suggest that SAA acts 
directly on both mouse and human Th17 cells to enhance their 
differentiation and effector function. 

To further study how SAA influences the transcriptional 
program of Th17 cells, we performed RNA sequencing (RNA- 
seq) with control and rmSAAl -treated murine Th17 cells. We 
identified a specific subset of the Th17 gene expression pro- 
gram that was specifically affected by rmSAAl treatment 
(Figures 3D and S3F). Unbiased Ingenuity Pathway Analysis 
predicted RORyt, the key regulator of the Th17 differentiation 
program, as the top upstream regulatory factor sensitive to 
SAA perturbation in vitro (Figure 3E). ROR-element-driven 
luciferase assays in Th17 cells confirmed that RORyt tran- 
scriptional activity was potentiated during rmSAAl stimulation 
in vitro (Figure 3F). 



(C) Representative western blots of lysates from the EDTA-epithelial-rich fraction of different regions of the Gl tract in animals with or without SFB colonization. 
(D-F) Role of SAAs in intestinal LP Th1 7 cell differentiation following SFB colonization. (D and E) IL-1 7A reporter littermates in WT (black, n = 5) or Saa1l2 double- 
knockout (DKO) (gray, n = 5) backgrounds were colonized with SFB for 7 days. RORyf^ cells among CD4'^ LP T cells and their IL-17A producing capacity were 
measured by FACS. MFI and cell numbers of CD4^RORYt^Foxp3“GFP^ Th1 7 cells were quantified. (F) Naive CD4^ T cells from SFB-specific TOR transgenic (7B8) 
mice were adoptively transferred into SFB-colonized WT (circles, n = 4) orSaa1l2 DKO (squares, n = 4) littermates (as indicated in diagram). Total number of donor 
cells recovered from ileum (far left), percentage of RORyt-expressing donor-derived cells (center left), percentage of IL-17A-expressing cells upon ex vivo 
re-stimulation by PMA/lonomycin (center right), and their total number (far right) were monitored by FACS at 7 days post-transfer. 

N.S, not significant (p > 0.05); *p < 0.05; **p < 0.01 ; ***p < 0.001 ; and ****p < 0.0001 . Data are represented as mean with SD. See also Figure S2. 
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Figure 3. SAA1 Promotes IL-17A Production in Polarized Th17 Cells In Vitro 

(A) Schematic representation of the experimentai protocoi. CFSE-iabeied naive CD4^ T ceiis from WT mice were activated under non-poiarizing (ThO: anti-CD3 
and anti-CD28) or Th17 poiarization condition (Th17: anti-CD3, anti-CD28, iL-6, and TGF-p). At 24 hr, RORyC T ceiis were further cuitured with recombinant 
mouse SAA1 (rmSAAl) under Th17 poiarization conditions for 48 hr. 

(B) iL-17A and IL-17F production by in vitro-poiarized mouse Th17 ceiis (CD4'^ RORyC) at 48 hr after rmSAAl (10 |xg/mi) addition. Ceiis were re-stimuiated with 
PMA/ionomycin for 5 hr, and cytokine production and CFSE diiution were monitored by FACS anaiysis. 

(legend continued on next page) 
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SFB-Induced SAA1/2 Requires IL-22 Expression and 
Epithelial StatS Signaling 

We next sought to understand the upstream signaling pathway 
that regulates ileal SAA induction in response to SFB coloniza- 
tion. SFB-free C57BL/6 mice were co-housed with SFB-positive 
mice (Taconic) for 2 weeks before epithelial-specific translating 
RNAs were assessed. RNA-seq profiling of polysome-associ- 
ated mRNAs pulled-down from SFB-colonized TRAP Villin-Cre 
mice revealed a significant enrichment of actively translated 
genes that are known to be regulated by the transcription factor 
stats (Figure S4A). To corroborate this analysis, we used immu- 
nofluorescence to probe for StatS activity (pStatS) in small intes- 
tinal tissue sections of mice acutely colonized with SFB® feces. 
We detected pStatS within 24 hr of SFB colonization (Figure 4A). 
Strikingly, SFB® fecal-induced upregulation of SAA1/2 was abro- 
gated almost entirely in mice with intestinal epithelial cell-spe- 
cific inactivation of Stat3 (Figure 4B). Stat3 activation is elicited 
by multiple cytokines, including IL-10 and members of the IL- 
20 subfamily, of which IL-22 is the best characterized (Rutz 
et al., 2014). Interestingly, we detected a significant increase in 
IL-22 mRNA within the terminal ileum shortly after colonization 
with SFB®, and this preceded the induction of epithelial SAA1/2 
expression by one day (Figures 4C-4E). This result suggested 
that IL-22 activated StatS, which then induced SAA1/2 expres- 
sion. Indeed, treatment with a neutralizing anti-IL-22 antibody 
prevented SAA1/2 induction following SFB colonization (Fig- 
ure S4B). Consistent with this result, SAA1/2 mRNA induction 
by SFB was almost completely abrogated in IL-22 deficient 
mice (Figure 4F). In addition, IL-17A production among RORyt^ 
Th17 cells in ileum was partially blocked in animals receiving 
IL-22 blocking antibodies, similar to what was observed in 
SAA1/2 DKO mice (Figure S4C). 

To evaluate whether IL-22 can directly induce epithelial SAA1/ 
2 expression, we generated small intestinal epithelial (SIE) orga- 
noids as previously described (Sato and Clevers, 2013) and 
exposed them to recombinant murine IL-22 (rlL-22). The addition 
of rlL-22 to SIE organoid cultures induced SAA1/2 mRNA in a 
dose-dependent fashion within 2 hr (Figure 4G). Coupled with 
the in vivo results, these findings indicate that SFB colonization 
initiates an IL-22-mediated StatS signaling cascade in epithelial 
cells, resulting in localized SAA1/2 expression. 

SFB Stimulates IL-23R-Dependent IL-22 Production by 
Activated ILC3 

In the intestine, IL-22 is produced by RORyt-expressing 
lymphoid cells that include ILC3, Th17, and y617 cells (Sawa 
et al., 2011). To investigate the source of IL-22 that directs 



epithelial SAA production following SFB colonization, we 
compared RNA expression profiles from small intestine Th17 
cells and ILC3 in mice stably colonized with SFB. Based on 
normalized DESeq reads, IL-22 expression was 10-fold higher 
in ILC3s than in Th17 cells (Figure S5A). Examination of these 
data sets also revealed a surface antigen, killer cell lectin-like re- 
ceptor subfamily B member 1 B (Kirbib), which was uniquely ex- 
pressed in ILC3. We confirmed this finding at the level of protein 
by flow cytometry (Figure S5B), and subsequently incorporated 
Kirbib into the fluorescent cell-labeling panel to dispense with 
the necessity of a reporter in studying the dynamic behavior of 
ILC3 upon SFB® colonization. 

At day 4 of SFB® colonization, up to 80% of ILC3s expressed 
the activation marker stem cell antigen-1 (Sca-1), which 
amounted to a 10-fold increase compared to the proportion of 
Sca-1 ^ ILC3s in mice gavaged with SFB-negative feces (Figures 
5A and 5B). Unlike CCR6“ ILC3s, CCR6'" ILC3s typically popu- 
late the isolated lymphoid follicles and cryptopatches (Lugering 
et al., 2010). Notably, both CCR6“ and CCR6^ ILC3s expressed 
high level of Sca-1 on their surface in response to SFB coloniza- 
tion. Although ILC3s increased in size in response to SFB colo- 
nization, their absolute numbers remained unchanged (Figures 
S5C-S5E). Intracellular cytokine staining also revealed a striking 
increase in both the frequency and total number of ILC3s pro- 
ducing IL-22 (Figure 5C). We recapitulated these findings using 
the fecal contents from SFB mono-associated mice, supporting 
the notion that SFB alone is sufficient to direct the localized ILC3 
response (Figure S5F). Importantly, at this early time point, other 
RORyt-dependent lineages, including y617 and (non-SFB-spe- 
cific) Th17 cells, did not produce IL-22 (Figure 5D). Furthermore, 
epithelial cells from Rag2 mutant mice, harboring ILC3s but lack- 
ing B and T cells, also expressed SAA1/2 in response to SFB 
colonization (Figure 5E). 

ILC3s residing in the colon secrete IL-22 in response to IL-23 
(Eken et al., 2014). To determine whether this mechanism also 
governs terminal ileal production of IL-22 by ILC3s upon SFB 
colonization, we colonized IL-23R-sufficient and -deficient 
mice with SFB® feces and examined the spontaneous release 
of IL-22 by ILC3s. Both the frequency and absolute number of 
IL-22-secreting CCR6^ and CCR6“ ILC3s declined sharply in 
IL-23R-deficient animals (Figures 6A and 6B). Consistent with 
this, induction of epithelial SAA1/2 mRNA was markedly blunted 
in IL-23R-deficient animals (Figure 6C). Altogether, these find- 
ings indicate that IL-23 signaling initiates IL-22-mediated cross- 
talk between ILC3s and small intestine epithelial cells, resulting 
in spatially restricted SAA production, which contributes to local 
augmentation of Th17 effector functions (Figure S6). 



(C) Representative data from polarized cells (48 hr) (four experiments total). The frequencies of IL-17A- and IL-17F-producing cells during each cell division are 
displayed using the gates represented in (B). Data are shown as mean with SD. 

(D) RNA-seq studies from two biological replicates comparing in vitro Th1 7 cells cultured in 20 ng/ml IL-6 and 0.1 ng/ml TGF-(3 with or without 2 |ag/ml rmSAAl for 
48 hr. Dots represent FPKMs of individual mRNA transcripts in the respective cell type (gray). 1 95 genes that were significantly different between vehicle and SAA 
treatment (p value <0.05, fold change >2) are highlighted in black. 

(E) Ingenuity Pathway Analysis for upstream regulators of the 195 SAA-affected genes in Th17 cells. 

(F) A ROR-responsive element (RORE) firefly luciferase reporter was used to monitor RORyt transcriptional activity in Th17 cells. The RORE-luciferase reporter 
construct was introduced by nucleofection into Th1 7 cells polarized for 48 hr and cultured with or without rmSAAl (2 |ig/ml), and luciferase activity was measured 
24 hr later. Data were normalized by renilla reporter activity. Results from four technical repeats in each condition were averaged. 

Results are representative of three independent experiments. *p < 0.05; **p < 0.01 ; ***p < 0.001 ; and ****p < 0.0001 . See also Figure S3. 
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Figure 4. Requirement for IL-22 Activation 
of Epithelial Stat3 in SFB-Dependent Induc- 
tion of SAA1/2 

(A) Immunohistochemistry analysis of phospho- 
StatS (green) in terminal ileum from SFB-colonized 
(SFBO or SFB free (JAX) mice on day 1 or day 3 
following gavage. EpCAM (red) and DAPI (blue) 
staining was used to identify the intestinal epithelial 
layer. The experiment was repeated two times with 
similar results. 

(B) qRT-PCR analysis of SAA1/2 expression in the 
epithelial fraction of terminal ileum from Villin-Cre 
mice sufficient or deficient for epithelial StatS at 
4 days after SFB colonization. Data were normal- 
ized using Gapdh. 

(C) Kinetics of SFB colonization post-oral gavage 
of SFB® or JAX fecal contents. SFB genomic 1 6S in 
terminal ileum was quantified by qPCR analysis. 
Copy number of the SFB genomic 16S was 
normalized to host //-23/' genomic DNA (JAX; n = 4. 
SFB®; n = 7). 

(D and E) Kinetics of IL-22 (D) and SAA1/2 (E) 
mRNA expression post-SFB® gavage. (D) Level of 
IL-22 mRNA from isolated ileal lamina propria cells 
was assessed (JAX; n = 7. SFB®; n = 8 in each time 
points). (E) Level of SAA1/2 mRNA from epithelial 
fraction in ileum (JAX; n = 4. SFB®; n = 7). Graphs 
(C-E) show accumulated data from two or three 
independent experiments as mean with SD. Day 
0 samples were from animals not gavaged with 
additional fecal samples. 

(F) SAA1/2 mRNA expression, determined by 
qRT-PCR, in epithelial fraction of terminal ileum 
from IL-22 sufficient or deficient mice at 4 days 
after SFB gavage (JAX gavaged: n = 10, SFB® 
gavaged IL-22^^'^ and n = 5, SFB® gavaged 
IL-22-/-: n = 5). 

(G) SAA1/2 mRNA expression in small intestinal 
epithelial cell organoid cultures following rlL-22 
treatment. 

All qRT-PCR experiments were repeated two times 
with similar results, and data were normalized 
to GAPDH mRNA. Each dot represents a single 
mouse, and the orange bar indicates average 
relative gene expression of each group. N.S., 
not significant (p > 0.05); *p < 0.05; **p < 0.01; 
***p < 0.001 ; and ****p < 0.0001 . See also Figure S4. 



DISCUSSION 

In this report, we describe a two-step process for the functional 
differentiation of intestinal Th17 cells following colonization of 
mice with SFB. The first step is the priming and polarization of 
antigen-specific CD4‘^ T cells in the tissue-draining lymph nodes, 
resulting in a poised state marked by the expression of RORyt, 
and the second is the activation of a cytokine gene expression 
program in a tissue microenvironment in which epithelial cell- 
derived factors SAA1 and SAA2 act on poised cells. Temporally, 



Recombinant miL-22 (ng/mi) observed Specific SFB Colonization of 

the ileum by 24 hr post-SFB gavage. 
Within 5 days, SFB antigen-presentation 
occurred in the MLN, where a significant 
number of RORyf^ Th17 cells were first observed. By day 7, 
RORyf^ Th17 cells accumulated in the lamina propria of the in- 
testine. During the early phase of this commensal-induced 
host response, IL-22 and SAA1/2 mRNA were acutely upregu- 
lated in the ileum within 3 days. Our studies revealed that acti- 
vation of epithelial SAAs involved an SFB-initiated signaling 
pathway requiring IL-23 and ILC3-derived IL-22 (Figure S6). 
A similar IL-22-dependent pathway was recently described in 
SFB induction of Fut2, an enzyme that catalyzes fucosylation 
of glycans, thus providing nutritional substrates for commensals 



8 Cell 163, 1-13, October 8, 2015 ©2015 Elsevier Inc. 





Please cite this article in press as: Sane et al., An IL-23R/IL-22 Circuit Regulates Epithelial Serum Amyloid A to Promote Local Effector Th17 
Responses, Cell (2015), http://dx.doi.Org/10.1016/j.cell.2015.08.061 



Cell 




o> 

> 



o o 

^ d 

A 

iS ^ 

CO ■“ 



100 

80 

60 

40 

20 

0 




CCR6^ CCR6- 

ILC3 ILC3 




D 



CCR6+ CCR6- 





0 10^ 10^ 10® 0 10^ 10® 10"^ 10® 




IL-22 



Th17 




E 




Figure 5. SFB Colonization Induces IL-22 Production by ILCSs 

(A) Cell-surface expression of Sca-1 and CCR6 on ileal SILP ILCSs from mice at 4 days after oral gavage with JAX or SFB® fecal contents. 

(B) Frequency of Sca-1 -positive CCR6^ or CCR6“ ILCSs in the terminal ileum from JAX- or SFB®-gavaged mice at 4 days after gavage. Data are represented as 
mean with SD (JAX gavaged: n = 5, SFB® gavaged: n = 6). 

(C) IL-22 production in ileal SILP ILCS. Ileal LP cells from JAX- or SFB®-gavaged mice were isolated at 4 days after gavage and cultured in vitro with GolgiStop 
(without PMA/lonomycin re-stimulation) for 4 hr. IL-22-producing ILCSs were analyzed by FACS, and cell numbers were calculated (right panel). Bar graph shows 
accumulated data as mean with SD (JAX gavaged: n = 5, SFB® gavaged: n = 6). 

(D) IL-22 production in each indicated cell type isolated from the ileal LP of SFB-free (JAX; green line) and SFB-colonized (SFB®; magenta line) mice, analyzed at 
4 days after gavage. (A-D) The experiment was repeated at least three times with similar results. 

(legend continued on next page) 
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Figure 6. IL-23R Requirement for SFB- 
Induced Expression of IL-22 and SAA1/2 in 
ILC3 and Epithelial Cells, Respectively 

(A) IL-22 production in CCR6^ and CCR6“ iieai LP 
iLC3s from //-23/'-sufficient or -deficient mice. 

(B) Ceii number of IL-22-producing CCR6“ (ieft) or 
CCR6^ (right) iieai LP iLC3s in JAX- or SFB^-gav- 
aged mice, at 4 days after gavage. 

(C) Bar graphs show accumuiated data from two 
independent experiments with a totai of ten SFB- 
free (JAX) and five SFB-coionized (SFB®) mice of 
each genotype, shown as mean with SD (C) SAA1/ 
2 expression, anaiyzed by qRT-PCR, in the 
epitheiiai fraction from terminai iieum of indicated 
mice (JAX gavaged; n = 9, SFBe gavaged U-23f^'^\ 
n = 8, SFBe gavaged ll-23r^'^\ n = 14). (dot) This 
experiment was repeated twice with simiiar re- 
suits. Each dot represents a singie mouse, and the 
orange bar indicates average reiative gene 
expression of each group. 

*p < 0.05; **p < 0.01; ***p < 0.001; and ****p < 
0 . 0001 . 
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and contributing to barrier protection (Bry et al., 1 996; Goto et al., 
2014; Pickard et al., 2014). 

Although we identified a requirement for the ILC3-dependent 
pathway during the early induction of epithelial SAA1/2, we 
cannot rule out that SFB attachment to the epithelial cells, which 
is accompanied by reorganization of the cortical actin network, 
contributes to or is required for SAA induction. Indeed, Atarashi 
et al. (201 5 [this issue of Cell]) showed that rat SFB failed to atta- 
ch to ileal epithelium and to induce SAA1/2 in mice, yet induced 
ILC3 IL-22 production. Thus, multiple signals may be required for 
epithelial cell induction of SAA1/2 or, alternatively, a single IL-22 
signal may be sufficient, but its delivery to epithelial receptors 
may be compromised in mice colonized with rat SFB. 

SAAs in Homeostasis and Autoimmune Diseases 

SFB is considered a commensal microbe, as there is no evi- 
dence that it induces pathology despite its adhesion to the 
epithelial surface and its induction of host adaptive and innate 
immune responses. By virtue of its activation of epithelial 
Stat3, SFB may reinforce the terminal ileal epithelial firewall, 
which harbors the greatest bacterial load in the small intestine 
(Sekirov et al., 2010). In the large intestine, Stat3 was found to 
mediate protection from dextran sodium sulfate (DSS)-induced 
colon injury in an IL-22 dependent manner. In this model, SAA in- 
duction was similarly dependent on Stat3 and required for 
dampening local inflammation (Eckhardt et al., 2010; Neufert 
et al., 2010). Recently, Hooper and colleagues showed that 



SAAs bind with strong affinity to vitamin 
A derivatives (Martineau et al., 2015), 
which promote mucosal homeostasis 
(Hall et al., 2011). Therefore, in the context of SFB colonization, 
the elevation of SAA1/2 and Th1 7 cells may be considered a ho- 
meostatic mechanism that contributes to barrier integrity in the 
small intestine. 

SAAs are significantly upregulated in the joints and serum of 
rheumatoid arthritis patients (O’Hara et al., 2000) and are corre- 
lated with disease progression (Chambers et al., 1983). In mouse 
models, Th1 7 cells are also known to contribute to multiple auto- 
immune diseases (Genovese et al., 201 0; Huh et al., 201 1 ; Lep- 
pkes et al., 2009), and whether SAAs or differences in its binding 
of cofactors mediate these processes remain to be elucidated. 
As microbiota-specific helper T cells, including SFB-specific 
Th17 cells, circulate widely following their polarization in the 
gut-associated lymphoid tissues (Hand et al., 2012; Yang 
et al., 201 4), it is possible that they contribute to autoimmune dis- 
ease systemically through induction of their effector functions at 
sites where inflammatory mediators such as SAAs are elevated. 

Another possibility is that under normal circumstances SAAs 
regulate the resolution of inflammatory responses. Th17 cells 
often produce IFNy in addition to IL-17A at sites of inflammation 
(Ivanov et al., 2006), and this is thought to contribute to pathoge- 
nicity, but they can also give rise to IL-10-producing cells that 
may be important during the resolution of inflammation (Gagliani 
et al., 2015). Therefore, it will be important to study in detail the 
regulation of SAA expression in tissues during the course of 
autoimmune progression and whether their manipulation affects 
Th1 7-mediated pathology. Importantly, since we did not observe 



(E) SAA1/2 expression, analyzed by qRT-PCR, in the epithelial fraction from terminal ileum of Rag2-sufficient or -deficient mice, at 3 days after SFB (SFB®) 
or control (JAX) gavage. Data are from two independent experiments. Each symbol represents a single mouse, and the orange bar indicates average of relative 
gene expression in each group. Experiments were repeated two times with similar results and data were normalized by GAPDH mRNA and combined. *p < 0.05; 
**p < 0.01 ; ***p < 0.001 ; and ****p < 0.0001 . 

See also Figure S5. 
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a complete loss in Th1 7 cytokine production in SAA1 /2-deficient 
settings during SFB induction, we speculate that other epithelial 
cell-derived factors likely contribute to the regulation of the Th1 7 
effector program, which will be of great interest to study. 

Innate Lymphoid Cell Regulation of Barrier Tissue 
Immunity 

Tissue-resident RORyt-expressing innate lymphoid cells and 
“innate-like” T cells, e.g., y617 cells, can respond rapidly to 
cytokine stimulation, including IL-1p and IL-23, by producing 
their own cytokines, particularly IL-22. This occurs in the 
absence of antigen receptor stimulation in y617 cells (Cua and 
Tato, 2010; Martin et al., 2009; Sutton et al., 2009). For ILCSs, 
microbiota-induced local production of IL-23, IL-1p, and TL1a/ 
TNFSF15 by CXsCRI"^ mononuclear phagocytes is required for 
their production of IL-22 and intestinal barrier protection func- 
tions (Longman et al., 2014). We do not know whether RORyt^ 
SFB-specific T cells that migrate to the intestinal lamina propria 
require TCR signaling for induction of effector cytokines, or 
whether SAA1/2, in the appropriate microenvironment, suffices 
for this purpose. It will be important, in future studies, to identify 
the receptor(s) for SAA1/2 on Th17 cells and characterize the 
signaling mechanism required for the cytokine response. 

Our findings suggest that SAA1 and SAA2 serve as sentinels 
for the tissue microenvironment, enlisting T helper cells with 
poised Th17 functions to protect or heal the epithelial barrier. It 
will be important to determine whether the SAAs are unique in 
their ability to amplify the Th17 cell transcriptional program and 
whether other induced mediators similarly influence other func- 
tional T cell subsets. In this regard, IL-1 8 and IL-33 were recently 
shown to amplify Th1 effector responses during infection with 
Salmonella or Chlamydia (O’Donnell et al., 2014). Efforts to 
dissect the upstream pathways and secretory outputs that 
impinge on T cell effector functions at steady state and in the 
face of multiple challenges will create novel opportunities to 
modulate tissue immunity. 

A critical remaining question is how SFB are sensed within the 
ileum. Atarashi et al. (2015) showed that adhesion of microbes 
to epithelium is a necessary step for Th17 differentiation. It will 
be important to learn whether distinct signals received by ileal 
epithelial cells from bound SFB are critical for Th1 7 cell induction 
and subsequent cytokine production. Characterization of such 
signals may provide insights into how intestinal microbes 
contribute to local and systemic inflammation. 

EXPERIMENTAL PROCEDURES 

Mouse Strains and SFB 

C57BL/6 mice were obtained from laconic Farms or the Jackson Labora- 
tory. Aii transgenic animais were bred and maintained in the animai faciiity 
of the Skirbaii Institute (NYU School of Medicine) in specific-pathogen- 
free (SPF) conditions. //-23/^^'° mice (Awasthi et al., 2009) were provided 
by M. Oukka and maintained on Jackson flora. 11-22 mutant mice were pro- 
vided by Pfizer. Saa1l2 double-knockout mice were previously described 
(Eckhardt et al., 2010) and were backcrossed for eight generations onto 
the C57BL/6 background. SFB-specific Th17-TCRTg (7B8) mice were previ- 
ously described (Yang et al., 2014) and maintained on the reporter 

(JAX; C57BL/6-ll17atm1Bcgen/J) and Ly5.1 background (JAX; B6. SJL- 
Ptprea Pepeb/BoyJ). EF1a: Lox-stop-lox-GFP-L10 mice were previously 
described (Stanley et al., 2013) and bred to Cd4 Cre (laconic), Cd11c Cre 



(JAX), Villin Cre (JAX), or Albumin Cre (JAX) mice. mice (JAX, 

B6.129S1 Sfaf3tm1Xyfu/J) were bred to Villin-Cre mice. Six- to 18-week- 
old animals were used for experiments. Fecal pellets were provided by 
Y. Umesaki (Yakult) to establish a colony of SFB mono-colonized mice in 
our gnotobiotic animal facility. All transgenic animals were maintained in 
the animal facility of the Skirbaii Institute in accordance with protocols 
approved by the Institutional Animal Care and Use Committee of the NYU 
School of Medicine. 

Acute SFB Colonization 

Fecal pellets were collected from SFB-enriched H-23rRag2 DKO mice (SFB®), 
SFB mono-associated mice (pure SFB), or SFB-free JAX B6 mice (JAX), 
respectively. Fresh fecal pellets were homogenized through a 100-|am filter, 
pelleted at 3,400 rpm for 10 min, and re-suspended in PBS. Each animal 
was administered one-quarter pellet by oral gavage. 

Statistics 

Two-tailed unpaired Student’s t tests were performed to compare the results 
using Excel and Prism. We treated less than 0.05 of p value as significant dif- 
ferences. *p < 0.05, **p < 0.01 , ***p < 0.001 , and ****p < 0.0001 . All experiments 
were performed at least twice. 
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SUMMARY 

Epithelial cells are normally stably anchored, main- 
taining their relative positions and association with 
the basement membrane. Developmental rearrange- 
ments occur through cell intercalation, and cells can 
delaminate during epithelial-mesenchymal transi- 
tions and metastasis. We mapped the formation of 
lung neuroepithelial bodies (NEBs), innervated clus- 
ters of neuroendocrine/neurosensory cells within 
the bronchial epithelium, revealing a targeted mode 
of cell migration that we named “slithering,” in which 
cells transiently lose epithelial character but remain 
associated with the membrane while traversing 
neighboring epithelial cells to reach cluster sites. 
Immunostaining, lineage tracing, clonal analysis, 
and live imaging showed that NEB progenitors, 
initially distributed randomly, downregulate adhe- 
sion and polarity proteins, crawling over and be- 
tween neighboring cells to converge at diametrically 
opposed positions at bronchial branchpoints, where 
they reestablish epithelial structure and express 
neuroendocrine genes. There is little accompanying 
progenitor proliferation or apoptosis. Activation 
of the slithering program may explain why lung can- 
cers arising from neuroendocrine cells are highly 
metastatic. 

INTRODUCTION 

Epithelia are sheets of cells that line and protect the body and in- 
ternal organs, and the polarized cells that comprise them play 
important roles in absorption, secretion, and sensation. Epithe- 
lial cells are normally tightly attached to one another through 
specialized junctions and adhesion proteins along their lateral 
surface and anchored to the basement membrane at their basal 
surface. Although epithelial sheets can grow and change shape, 
the constituent cells typically maintain their relative positions. 
When cells in an epithelial monolayer have been found to rear- 
range, as in Drosophila germband elongation (Irvine and Wie- 
schaus, 1994) and wing morphogenesis (Aigouy et al., 2010), 



they do so conservatively by cell intercalation, in which cells 
shrink lateral junctions with some neighboring cells while ex- 
panding lateral junctions with others, exchanging positions while 
maintaining their polarized structure and the integrity of the 
epithelium (Bertet et al., 2004; Blankenship et al., 2006; Guillot 
and Lecuit, 2013). Here, we describe a very different mode of 
epithelial cell rearrangement that results in homotypic sorting 
(Krens and Heisenberg, 2011) of a specialized cell type, discov- 
ered in our dissection of pulmonary neuroendocrine (NE) cell 
development in mice. 

Pulmonary NE cells are one of the most interesting but least 
understood cell types in the lung. They are distributed 
throughout the bronchial epithelium, interspersed among secre- 
tory club (Clara) cells and ciliated cells, the two major airway 
epithelial cell types (Rock and Hogan, 2011). Like other neuroen- 
docrine cells in the body, they were originally identified by their 
secretory dense-core vesicles (Feyrter, 1954) that contain 
signaling molecules and bioactive peptides, including serotonin 
and calcitonin gene-related peptide (CGRP). Although some pul- 
monary NE cells are distributed randomly in the airway epithe- 
lium, others are organized into clusters called neuroendocrine 
or neuroepithelial bodies (NEBs) that are highly innervated 
(Brouns et al., 2009; Lauweryns and Peuskens, 1972), forming 
synaptic contacts with afferent and efferent nerve fibers (Lau- 
weryns and Van Lommel, 1987). NE cells can be activated by a 
variety of stimuli and are thought to monitor diverse aspects of 
lung physiology including oxygen, chemical, and mechanical 
changes (Cutz et al., 2013). In addition to these sensory and 
neurosecretory functions, NE cells have a stem cell function 
that helps replenish the bronchial epithelium following severe 
injury (Guha et al., 2012; Reynolds et al., 2000; Song et al., 
2012). They are also the initiating cells of small cell lung cancer 
(Park et al., 2011; Song et al., 2012; Sutherland et al., 2011), a 
highly metastatic and the most deadly form of lung cancer (van 
Meerbeeck et al., 201 1). Excess or altered distribution of NE cells 
are also found in a variety of serious, but poorly understood, lung 
diseases including sudden infant death syndrome (SIDS) (Cutz 
et al., 2007), bronchopulmonary dysplasia (BPD) (Gillan and 
Cutz, 1993), and neuroendocrine hyperplasia (Aguayo et al., 
1992; Deterding et al., 2005). 

To provide a foundation for a genetic dissection of the devel- 
opment, function, and diseases of pulmonary NE cells, we first 
mapped their locations in mice and found that NEBs are located 
at stereotyped positions. We then probed NEB development by 
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Figure 1. Mapping and Lineage Tracing of 
Pulmonary Neuroendocrine Cells 

(A-E) Confocal images of NE cells in bronchial 
epithelium of embryonic day (E) 18 (A-D) or 
adult (postnatal 2 month; E) 
f?osa26^^®^®®”^^mouse induced with tamoxifen at 
E14 to label NE cells with ZsGreen (green). See 
also Figure SI. (A and B) Solitary NE cells with 
pyramidal (A) and slender (B) morphologies. 
Arrowhead, thin projection to lumen. (C) Mini- 
cluster of four NE cells. (D) Neuroepithelial body 
(NEB), a large cluster of -^25 NE cells (12 visible in 
optical plane). (E) Adult NEB. Cells are more uni- 
formly columnar. Note non-NE cells (nuclei 
marked by dots) superficial to NE cells in NEBs 
(D and E). Sample was also stained with PGP9.5 
(red) to confirm NE identity and show innervating 
fibers (arrowhead). Dotted lines, individual NE cells 
in clusters; dashed lines, basement membrane 
location determined by co-staining with E-cad- 
herin or laminin y1- Scale bar, 10 ^im. 

(F-l) Stereotyped locations of NEBs at airway 
branchpoints. (F) Left primary bronchus (L) and 
some daughter (L.L1, L.L2, ...) and granddaughter 
(L.L1 .A1 , L.L1 .A2,...) branches showing positions of 
NEBs at E18. (G) Low magnification confocal image 
of left primary bronchus of E18 

mouse. Note pairs of NEBs (green) 
at indicated branch points (L.L2, L.M3), with one 
NEB located proximal (p) and the other distal (d) 
along bronchial tree. Scale bar, 100 ^im. (H) Longi- 
tudinal optical section through base of L.L2 showing 
L.L2p and L.L2d NEBs. Close-ups of NEBs (boxed) 
are shown at right. Scale bar, 5 i^m. (I) Schematic 
longitudinal section (LS) and cross section p<S) 
showing diametrically opposite positions of L.L2 
NEBs at branch base. See also Figure S2. 

(J-M) Lineage tracing of NE cell origins. Confocal images of NEBs (J and L) and solitary NE cells (K and M) at E18 from Shh^''^^^;Rosa26'^^^^^^ mouse to label 
bronchial epithelial lineage (J and K) or from Wnt1 -Cre;Rosa26^^’^''^^'^ mouse to label neural crest lineage (L and M). Lungs were co-stained for lineage tag 
(green) and NE marker CGRP (magenta). All NEB and solitary NE cells express bronchial trace and none express neural crest trace. Other epithelial cells also 
express bronchial trace, and sub-epithelial neurons (arrowheads) express Wnt1 trace. Dots, outline of NEBs or individual NE cells; dashes, basement 
membrane. Scale bar, 10 ^im. 




immunostaining, lineage tracing, and imaging of developmental 
intermediates and found that although progenitors are initially 
distributed randomly throughout the epithelium, they rapidly 
resolve into clusters. We show that clusters do not form by pro- 
genitor proliferation, but by a targeted mechanism of epithelial 
cell rearrangement in which progenitors transiently lose epithelial 
character as they “slither” over and around neighboring cells 
and converge at cluster sites. 

RESULTS 

Mapping the Origin and Distribution of NEBs 

Pulmonary NE cells are distributed sparsely throughout the bron- 
chial epithelium both as solitary cells and clusters. Most mature 
NE cells are typical columnar epithelial cells, however, some 
have distinct morphologies such as short “pyramidal” cells 
that do not reach the surface or slender cells with a thin luminal 
projection (Figures 1 A and 1 B). Clusters are either small, typically 
with two to five NE cells, which we call “mini-clusters” (Fig- 
ure 1C), or are larger clusters typically containing 20-30 NE cells 



(Figures 1 D and 1 E). The terms “neuroepithelial body” (NEB) and 
“neuroendocrine body” often refer to all NE cell clusters, but 
here, we reserve the term NEB for large clusters. NE clusters in 
other species are innervated (Lauweryns and Peuskens, 1972; 
Scheuermann, 1987), and we found by immunostaining for neu- 
rites with PGP9.5 (Uchl1 ) that mouse NEBs are extensively inner- 
vated and mini-clusters show some innervation, whereas solitary 
NE cells do not appear to be innervated. 

We mapped the locations of NE cells in the mouse bronchial 
tree in late stage embryonic lungs (embryonic days E1 6-E1 8) us- 
ing the airway branch lineage (Metzger et al., 2008). NE cells 
were detected by immunostaining for the proneural bHLH tran- 
scription factor AscH (Mash1) or cytoplasmic NE marker 
PGP9.5, or using an lineage trace, which within the 

bronchial epithelium labels exclusively the NE lineage (Figure S1 ). 
We analyzed serial sections of left lung lobes co-stained for 
E-cadherin to visualize airway epithelium. Solitary NE cells and 
mini-clusters were scattered throughout the bronchial epithelium 
in no obvious pattern. The pattern of NEBs, however, was highly 
stereotyped. NEBs localized exclusively to airway branchpoints 
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Figure 2. Cellular Events in NEB Formation 

Longitudinal optical sections and schematics of base of L.L2 bronchus at stages indicated in wild-type lungs stained for NE progenitor marker AscH (A-E) or 
mature marker PGP9.5 (F). Dashed lines, basement membrane. Three phases of NEB development are indicated: NE cell selection in bronchial epithelium, as 
indicated by expression of Ascii (open circles); NE cell cluster formation at NEB sites; and differentiation marked by expression of PGP9.5 (filled circles) and 
CGRP (see Figure S3F). Timing of Ascii , PGP9.5, and CGRP expression is indicated below. PGP9. 5-positive nerve fibers (arrowhead, F) extend toward and begin 
to ramify on L.L2 NEB at E16.5. Scale bar, 25 |im. See also Figure S3. 



(Figures 1F and 1G). Although none were found at the origin of 
the trachea or right and left primary bronchi, at every branchpoint 
examined beyond this a pair of NEBs was present (Figures 1F, 
1G, and S2). NEBs were located at diametrically opposite posi- 
tions at branch junctions, with one at the most proximal position 
and the other the most distal (Figures 1 H and 1 1). Hence, we 
designate each NEB by the name of the branch it originates, 
plus “p” or “d” to distinguish proximal and distal partners. The 
proximal NEB marks the most obtuse angle of the branch junc- 
tion, whereas the distal NEB lies at the most acute angle. The 
same pattern was observed at every branchpoint examined in 
other lobes (Figure S2) (n = 5 lungs, 70 branchpoints). Thus, 
NEBs form at stereotyped, diametrically opposite positions at 
secondary airway branchpoints and beyond, implying there is 
a localized signal directing NEB formation at each of these sites. 

NE cells in other organs arise either from invading neural crest 
cells that colonize the organ or by differentiation of resident 
epithelial cells (Anderson and Axel, 1986; Gu et al., 2002). Initial 
lineage trace studies indicated that at least some pulmonary NE 
cells arise from the epithelium (Song et al., 201 2). To determine if 
NEBs arise from the epithelium, we used a Sonic hedgehog^^^ 
(Shh^''^) knock-in allele (Harris et al., 2006) and a Cre recombi- 
nase reporter to label and lineage trace developing airway 
epithelial progenitors several days before NE progenitors are de- 
tected. At E18.5, all NEBs as well as mini-clusters and solitary 
pulmonary NE cells expressed the Shh^''^ lineage trace 
(n = 500 NE cells scored) (Figures 1 J and 1 K). In a similar exper- 
iment using a Wnt1-Cre transgene to lineage trace developing 
neural crest cells, no NE cells at E18.5 expressed the lineage 
trace (n = 500) (Figure 1L). We conclude that all pulmonary NE 
cells, including NEBs, arise from the Shh epithelial lineage, not 
from neural crest. Thus, both solitary and clustered pulmonary 
NE cells share the same origin as other airway epithelial cells. 

Three Phases of NEB Development 

To elucidate the cellular events and timing of NEB formation, we 
mapped the development of specific NEBs beginning at E12, 



before NE progenitors are detected, through E16 when NEBs 
achieve their characteristic structure. We used AscH , the earliest 
known marker of NE cells (Ito et al., 2000), to visualize progeni- 
tors and early events and PGP9.5 and CGRP, canonical markers 
of mature NE cells (Polak et al., 1993), to visualize later steps. We 
analyzed five serially sectioned wild-type CD-1 lungs at each of 
six stages between E12-E16, initially focusing on the base of 
branch L.L2 where NEBs L.L2p and L.L2d form (Figure 2). The 
earliest NE progenitors at these sites were detected at late 
E12.5-E13, when a few scattered cells in the epithelium began 
to express AscH (Figure 2A). At E13, more densely distributed 
but still solitary NE progenitors were present in a “salt and 
pepper” pattern (Figure 2B). By late E13.5-E14, small groups 
of NE progenitors were detected at L.L2p and L.L2d and also 
scattered around the region (Figure 2C). By E14.5, within 24- 
48 hr of detection of the first NE progenitors, numerous small 
clusters of NE cells began to appear throughout the region, inter- 
spersed with solitary cells (Figures 2D and S3B). At E15.5, large 
clusters of NE cells were detected, but exclusively at the posi- 
tions of NEBs L.L2p and L.L2d (Figure 2E). By E15.5-E16, NE 
cells in NEBs L.L2p and L.L2d expressed PGP9.5 and CGRP, 
as did solitary NE cells and mini-clusters in the region (Figures 
2F, S3E, and S3F). This analysis subdivided NEB formation 
into three developmental phases: NE cell selection (E1 2.5- 
El 3.5), cluster formation (E13.5-E15.5), and differentiation 
(E15.5 onward). During differentiation, PGP9. 5-positive nerve fi- 
bers extend toward and begin to ramify on the NEB (Figures 2F 
and S3G-S3I). 

The same series of cellular events and developmental phases 
was observed at other positions where NEB formation was map- 
ped, including L.L3p/L.L3d, L.L4p/L.L4d, and L.L5p/L.L5d, all 
located at the base of secondary branches along the left primary 
bronchus. However, the process initiates slightly later at the 
more distal positions, delayed by ~0.5 day for L.L3, 1 day for 
L.L4, and 1 .5 days for L.L5 NEBs. This is best appreciated in im- 
ages capturing all four positions in the same lung, when the most 
distal branch (L.L5) has not yet initiated NE development, L.L4 is 
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Figure 3. Analysis of Proliferation during NEB Formation 

(A) Close-up of NEB in Shh^''^^^;Rosa26'^'^'^^^ mouse lung in which entire 
bronchial epithelium was permanently labeled with one of the Rainbow re- 
porters (Orange, Cerulean, or Cherry) before NE selection then co-stained for 
CGRP (magenta) at E18 following NEB formation. NEB (dotted outline) is 
composed of cells labeled with each of the reporters, thus it did not arise by 
proliferation of a single progenitor. Epithelial cells neighboring the NEB are not 
shown in schematic; cells below basement membrane (dashed line) express 
the constitutive (expressed in absence of Cre) GFP reporter (pseudo-colored 
gray). Scale bar, 10 i^m. See also Table SI. 

(B) Close-up of a NEB in E1 8 Asc/ 7 mouse lung in which 
NE progenitors were sparsely labeled by injection of tamoxifen (4 mg) early 
in NE development (E1 1 .5) then immunostained for CGRP (magenta) 6 days 
later following NEB formation. Note one labeled cell (pseudocolored yellow) in 
NEB, indicating that labeled progenitor did not proliferate. Cells without Cre 
activity (“Unrecombined”) express the GFP reporter (pseudo-colored gray 
in middle panel); note absence of GFP in clone (yellow cell). Only cells in 
NEB (dotted lines) are shown in schematic. Scale bar, 10 |im. See also Tables 
S2A-S2C. 

(C) Confocal sagittal section through cluster of NE cells during NEB formation 
in E14 wild-type embryo in which EdU was injected 2 hr before harvesting 
to label dividing cells, then analyzed after co-staining for EdU (green) and 



undergoing NE cell selection, L.L3 has begun NE clustering, and 
L.L2 is in the NE differentiation phase (Figures S3A and S3B). 
Thus, NEBs form in three phases— NE selection, cluster forma- 
tion, and differentiation— that initiate in a proximal-to-distal 
sequence separated by approximately a half day between 
branches along the left primary bronchus. Despite differences 
in the timing of NEB program initiation along the proximal-distal 
axis, the differentiation phase begins in near synchrony for all 
NEBs and other NE cells (Figures S3G-S3I), suggesting that 
this final phase is triggered by a more global signal independent 
of earlier steps in the program. 

Cluster Formation Does Not Occur by Proliferation of NE 
Precursors 

NE and neurosensory cells in many tissues are arranged in clus- 
ters, but how clusters form is only understood for Drosophila and 
other insect sensory bristles. Each Drosophila bristle arises by 
proliferation of a single progenitor expressing Achaete-Scute 
proteins (founders of the AscH gene family), and the clonal cell 
cluster differentiates into bristle sensory and support cells (Cu- 
bes et al., 1991; Lawrence, 1966). To determine if NEBs form 
by clonal proliferation of an epithelial progenitor, we labeled all 
airway epithelial progenitors prior to NEB formation using Shh^''^ 
and the “Rainbow” (Rosa26'^^'^) Cre-reporter (Rinkevich et al., 
2011), which stochastically and permanently labels each pro- 
genitor and any of its daughter cells with one of three different 
fluorescent proteins. If NEBs form like Drosophila sensory bris- 
tles, each mature NEB should be monoclonal and hence 
composed of cells of a single Rainbow color. We found instead 
that all NEBs examined at E17.5-E18 (n > 100), including those 
examined closely by confocal microscopy to distinguish individ- 
ual NE cells, were multi-colored (Figure 3A; Table S1). A similar 
result obtained for NE miniclusters. Thus, NEBs and miniclusters 
arise from multiple founder cells, not clonal proliferation of a 
single NE progenitor. 

Although the above results rule out NEB formation by prolifer- 
ation of a single progenitor, they do not exclude the possibility 
that NEBs are polyclonal, arising by proliferation of multiple NE 
progenitors. We therefore carried out two types of experiments 
to investigate the role of progenitor proliferation in NEB forma- 
tion. We first used an inducible Cre recombinase (Cre-ER) ex- 
pressed from the endogenous AscH promoter (Asc/7^''®^^) 
(Kim et al., 2011), and an early (~E1 1 .5-E12) dose of the inducer 
tamoxifen, to sparsely label individual NE progenitors with one of 
the Rainbow colors. When mature NEBs and miniclusters were 
examined 6 days later, few clusters were labeled, and of those 
that were, most contained just a single labeled cell (30 of 39 
labeled clusters) (Figure 3B). In those with more than one labeled 



AscH (red). No NE cells, but many surrounding epithelial cells, are labeled 
with EdU. 

(D) Quantification of EdU-positive NE and non-NE airway epithelial cells at 
indicated ages. Values shown are percent ± SE for proportions (n = 134, 119, 
78, 130 [NE cells], n = 520, 322, 326, 930 [non-NE cells] at E13.5, 14.5, 15.5, 
16.5, respectively) scored in two embryos for each cell type scored at each 
age. Note few EdU-positive NE cells, indicating progenitors exit cell cycle at or 
just after selection. Similar results were obtained for phospho-histone H3 
analysis. Scale bar, 10 lam. 
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Figure 4. Local Clearing of NE Cell Precursors during Cluster 
Formation 

(A) Cross-sections through base of bronchial branch L.V2 at indicated ages of 
wild-type embryos immunostained for Ascii (white; green in schematics 
below) to show NE progenitors forming NEBs L.V2p and L.V2d. Progenitors 
are initially scattered in “salt and pepper” pattern but, as NEBs form, sur- 
rounding regions clear of NE cells. Scale bar, 10 |im. 

(B) Sagittal section through developing NEB (dotted outline) in E14 lung co- 
stained for Ascii (red) and activated (cleaved) Caspase 3 (Casp3*, green) to 
detect apoptotic cells. Note Casp3*-positive cells near, but not within, NEB 
(dotted outline). Ascii signal below basement membrane (dashed line) at E13 
and E14 is non-specific background. Scale bar, 10 |am. 

(C) Quantification of Casp3*-positive NE and non-NE cells in bronchial 
epithelium at indicated stages. Data shown are percent ± SE for proportions; 
n = 589, 406, 322 (NE cells), 1 ,437, 1 ,534, 1 ,122 (non-NE cells) scored at E14.5, 
15.5, 16.5, respectively, from two embryos at each stage. 

cell, the cells were usually different colors, in the ratio expected 
for independent labeling events (Table S2). These clonal labeling 
results imply that there is little or no proliferation of NE progeni- 
tors once they express the progenitor marker AscH. A similar 
conclusion obtained by pulse-labeling with deoxyribonucleotide 
analog EdU from E12-E15 to detect dividing NE cells. Although 
EdU-labeled cells were readily detected in surrounding (non- 
neuroendocrine) epithelial cells and mesenchyme, few labeled 
NE cells were detected (Figures 3C and 3D). The only labeled 
NE cells (<1 % of NE cells) were solitary NE progenitors in the 
most distal regions of the bronchial tree just initiating AscH 
expression. We conclude that NE progenitors cease dividing at 
or shortly after initiation of AscH expression, and proliferation 
of progenitors does not contribute significantly to cluster 
formation. 

Local Clearing of NE Cell Precursors without Apoptosis 

Inspection of the airway epithelium immediately surrounding 
positions where NEBs form showed local “clearing” of NE pro- 
genitors: progenitors were initially evenly distributed in a salt 
and pepper pattern like other regions, but labeled cells were 
lost as NEBs formed nearby (Figure 4A). To determine if clearing 



was due to programmed cell death, we examined the regions 
for expression of the apoptosis marker, activated Caspase-3. 
Although activated Caspase-3 was detected in other developing 
cell types, little or none was detected in AscH -expressing NE 
progenitors (Figures 4B and 4C). Cell counts also showed no 
decline in NE progenitors during cluster formation (Figures S3C 
and S3D). Thus, apoptosis does not contribute to clearing of 
NE progenitors from regions surrounding NEB formation. 

Sparse Labeling Reveals a Series of Migratory NE 
Intermediates 

Local clearing of progenitors and declining number of solitary NE 
cells as the number of clustered cells and size of the clusters 
increased (Figures S3C and S3D) suggested that NEB formation 
might occur by local sorting of NE progenitors. To identify and 
visualize developmental intermediates, we sparsely labeled NE 
progenitors at E11.5 or E12.5 using AscH^''^^^ and limiting 
tamoxifen in conjunction with the Rosa26^^^’'^^'^ reporter that 
robustly labels NE progenitors, allowing visualization of the full 
structure of individual intermediates (Figures S4 and S5). Exam- 
ination of labeled progenitors 2-4 days later (Figure 5A) revealed 
a collection of developmental intermediates with striking migra- 
tory morphologies, including long cytoplasmic extensions, fibro- 
blast-like shapes oriented orthogonal to neighboring epithelial 
cells— some fully detached from the basement membrane— 
and cells spiraling around each other and converging at the 
basement membrane (Figures 5B-5F). NE cells with migratory 
structures were abundant in regions surrounding sites of NEB 
formation, and they were transient intermediates detected only 
during NEB formation (labeling between E12 and E13 but not 
at E14 or later) that “chased” into mature NEBs in the 36 hr after 
labeling. The migratory morphologies were specific to the NE 
lineage as other epithelial cells labeled during this period using 
FoxA2^''^^^ showed the typical structure of monolayer or pseu- 
dostratified epithelial cells (Figure S6). 

Analysis of over 180 labeled NE intermediates (Table S3) by 
confocal microscopy distinguished five morphologic classes. 
Class 1 cells were the earliest and simplest intermediates: 
solitary NE cell progenitors lacking extensions or unusual 
morphologic features (Figure 5B). They are early progenitors, 
presumably before migration has occurred, as they were the 
only labeled cells detected during the earliest pulse-chase inter- 
val (E1 1 .5-E1 3.5). Class 5 intermediates were similar in structure 
to Class 1 cells, each showing normal epithelial structure and 
orientation, except class 5 cells were found in clusters (Fig- 
ure 5F). Class 5 cells are the most advanced intermediates 
because they were the only type detected at E1 6.5 and later (Fig- 
ure 5G). The other three classes had migratory morphologies and 
were the most striking and unusual structures. Class 2 interme- 
diates are solitary and resemble class 1 intermediates with a 
largely normal epithelial morphology, except they have a long 
(up to 30 |im) cellular extension that reaches over or around 
neighboring epithelial cells, typically contacting another NE cell 
several cell diameters away (Figure 5C; Table S3). Class 3 inter- 
mediates have the most surprising structure, betraying their 
epithelial origin. They resembled fibroblasts and had protrusions 
and often twisted morphologies that curved over or around, and 
occasionally under, neighboring epithelial cells. Some had no 
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Figure 5. Sparse Labeling of Progenitors Reveals a Series of NEB Intermediates 

(A) Labeling scheme. NE progenitors were permanently labeled with ZsGreen by tamoxifen injection of Asc/ 7 mice at indicated ages (dots), 

then allowed to develop for times indicated (arrows). 

(B-F) Micrographs and schematics of five classes of NE intermediates (green) labeled as above and co-stained for E-cadherin to outline epithelial cells. 
A representative cell of each class is outlined (dots) in micrograph and highlighted in dark green in schematic; other labeled NE cells are shown in light green in 
schematic. Note long (>10 |im) apical cell extension (class 2), fibroblast-like morphology and no basement membrane contact (class 3), and thin cellular ex- 
tensions that converge at basement membrane (class 4). In late embryo (-^E17) through adult, all NE cells contact basement membrane (class 5). Scale bar, 
10 i^m. See also Figures S4, S5, and Table S3. 

(G) Quantification of different classes of NEB intermediates labeled as above and harvested 48 hr after tamoxifen injection at ages indicated. The changing 
abundance of intermediates supports their ordering in a developmental series, n, number of intermediates scored in two lungs at each stage. 

(H-K) Confocal images (sagittal sections; H-J) and quantification (K) of NEB intermediates labeled as above at base of bronchi L.L4, L.L3, and L.L2 in same lung. 
L.L4 is most distal (immature) and L.L2 is most proximal (mature) position. The different abundance of intermediates along proximal-distal axis supports their 
ordering in a developmental series, n, number of intermediates scored in two embryos. Scale bar, 10 |im. 

See also Figure S6. 



detectable contact with the basement membrane, or contacted 
it only via a thin cellular extension (Figure 5D). Class 4 intermedi- 
ates were associated with other NE cells, organized into nascent 
clusters with cell bodies localized apically and thin extensions 
swirling as they converged at a site on the basement membrane 
(Figure 5E). 

We inferred from the structures of the intermediates that they 
comprise a developmental series, beginning with class 1 and 
proceeding sequentially to class 5 (Figure 7A): NE progenitors 
first extend long processes and move toward their target at a 
branch junction, losing their epithelial structure and adopting a 
mesenchymal morphology as they leave their birth site. They 
then migrate to the target, converging there with other progeni- 
tors and re-establishing their epithelial structure and organizing 
into clusters. Similar intermediates were observed at sites of 
mini-cluster formation (Table S3), suggesting that miniclusters 
form by a related sorting process. 

The timing and location of intermediates at sites of NEB forma- 
tion support the proposed sequence. Quantification of the abun- 
dance of each type of intermediate at the same site (along the 
primary left lobe bronchus) but different times in NEB develop- 



ment (E1 3.5-E1 6.5; Figure 5G), or at different stages in the devel- 
opmental process (represented by different positions along the 
proximal-distal axis (L.L4, L.L3, L.L2) at the same time in devel- 
opment (E14) (Figures 5H-5K), supported the proposed order. 
Class 5 intermediates were also the only ones innervated, con- 
firming they are the most mature. Marker expression patterns 
provide further support for the proposed sequence, with expres- 
sion of progenitor marker AscH present in class 1 and 2 but pro- 
gressively declining in class 3, 4, and 5 cells, and expression of 
late NE markers initiating in class 4 (PGP9.5) and 5 (CGRP) cells 
(Figures 2, 7A, and 7B). 

Live Imaging of NE Migration during Cluster Formation 

To investigate dynamics of progenitor migration and sorting, 
we developed a slice culture preparation from transgenic 

embryonic mouse lungs with 
NE progenitors labeled with ZsGreen and membrane GFP 
(mGFP) and other cells labeled with mTomato. This allowed 
tracking by time-lapse confocal microscopy of individual NE pro- 
genitors for 2-3 days in the 175-|im thick lung slices as they 
moved toward and integrated into target sites. This showed 
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Figure 6. Live Imaging of Migrating NE Pro- 
genitors in Lung Slice Culture 

(A-L) NE cell migration and entry into NEB. 
Selected frames from 26 hr time-lapse confocal 
microscopy (Movie SI ) of migrating NE cells in El 5 
mouse lung slice culture from 

mouse induced with tamox- 
ifen at El 3 to label NE cells with ZsGreen and 
mGFP (green) and other cells with mTomato (red). 
NE cells 1-3 at developing NEB site are joined by 
cell 4 (red outline), which migrates into region 
(B and C; at -^1.6 |im/hr) and directly enters 
cluster (D). Cell 5 (blue outline) migrates into same 
region (D; -^1 .6 |am/hr) but soon changes direction 
and extends toward cell 4 (E). Cell 4 reciprocates 
(F) and the cells contact briefly (G, “kissing”) and 
retract (H), repeating kissing sequence four times 
in hr. Over the next 5 hr, cell 5 remains sta- 
tionary (I, “resting”) before extending backward 
(J, “extending away”) and then diving forward 
(K, “entry”) to join NEB cluster (L). 

(M) Timeline of above changes in cell 4 (red) and 5 
(blue). 

(N-U) NE cell crossing bronchial tube. Selected 
frames from another region of same culture. 
Initially, NE cell (marked with white asterisk) flat- 
tens along basement membrane (N, “flattening”) 
then re-orients toward lumen (0-Q, “extending 
out”). Over next ~14 hr, its contact with basement 
changes dynamically (R and S, “dynamic an- 
chors”), including formation of a second extension 
that contacts basement membrane >10 ^im away 
(S). Just before crossing, anchor is a very fine 
projection with “beaded” appearance (T) that 
fragments as cell finally crosses (-^2.5 |im/hr) into 
epithelium of opposing side of tube (crossing, T) 
to reach destination (U, arrival). Frame U was 
selected from a z plane 8 lam deeper than (N)-(T) to 
more clearly visualize cell after arrival. Dashed line, 
basement membrane. Black asterisk, stationary 
NE cell. Scale bars, 10 i^m. See also Movies SI 
and S2. 

(V) Timeline of above changes in NE cell marked 
with white asterisk. 



progenitors migrating toward targets at 1-3 |im/hr, dynamically 
extending processes and changing shape as they crawled over 
and around neighboring epithelial cells (Figures 6A-6M; Movie 
S1). Progenitors migrated tens of microns or more, traversing 
many neighboring epithelial cells or even crossing from one 
side of a developing bronchial tube to the other when the sides 
were apposed (Figures 6N-6V; Movie S2). Periodically, the 
migrating cells paused (“resting,” Figures 6I and 6M; Movie 
S1) or changed direction (Figure 6J, “extending away”; Movie 
S1 ), or displayed dynamic interactions with other NE cells (“kiss- 
ing,” Figures 6F-6H; Movie S1) or the basement membrane 
(“dynamic anchors,” Figures 6R and 6S; Movie S2). After the 
progenitor approached the target, this slow and halting migration 
phase was followed by direct and rapid (~5 |im/hr for 2 hr) entry 
into the developing cluster. Live imaging confirmed this mode of 
NE cell migration predicted by the in vivo intermediates, and it re- 
vealed dynamic “exploratory” behaviors and interactions with 



other NE cells and the basement membrane as migrating cells 
seek and ultimately converge at the target. 

Transient Downregulation of Epithelial Adhesion and 
Polarity in Migrating Progenitors 

The intermediates and movements imply that NE progenitors un- 
dergo changes in adhesion and polarity as they migrate and form 
a NEB. To begin to elucidate the underlying molecular events, we 
immunostained intermediates for adhesion and polarity markers 
(Figures 7A and 7C-7E). E-cadherin showed dynamic changes in 
expression during cluster formation (Figures 7A and 7C). The 
adhesion protein was expressed at normal levels in early progen- 
itors but dramatically downregulated in migrating class 3 inter- 
mediates. Downregulation was transient because E-cadherin 
was again detected in class 5 intermediates that reached their 
target and re-established full contact with the basal lamina, 
although even then E-cadherin levels remained below that of 
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Figure 7. NE Progenitors Transiently Express EMT Regulator Snail and Downregulate Adhesion and Polarity Markers 

(A) Expression of NE, adhesion (adhes), polarity, and EMT markers in NEB progenitors and intermediates (dark green) at steps indicated: before NE selection (pre- 
selection) and during NE selection (class 1), migration to form clusters (classes 2-4), and differentiation (class 5). Light green cells, maturing NE cell already 
at target (asterisk); white cells, NE cells before selection and other non-NE epithelial cells. Symbols show high (++), intermediate (+), low (+/-), no (-), or mixed 
(+ and -) expression. For polarity markers, expression refers to expression restricted to basal (bas), basolateral (bas-lat), or apical domains. See also Figure S7. 
(B-E) Confocal micrographs showing dynamic patterns of indicated NE (B), adhesion (C), polarity (D), and EMT (E) markers at indicated steps of NEB formation. 
Dots, outline of NEB; dashed line, basement membrane. (B) Ascii (green) is expressed during migration (left panels; E1 3) but downregulated during differentiation 
(right panels; E18), whereas CGRP (magenta) is detected only during differentiation. (C) E-cadherin is expressed around entire plasma membrane during NE 
selection (Ascii -positive (green) cell at E13; left panels), but is downregulated in class 3 and 4 intermediates (some of which are marked by ZsGreen in E15 
mouse induced with tamoxifen at E12.5) in developing NEB (dotted outline; center panels). E-cadherin is again detected in NE cells 
(CGRP-positive; green) of mature NEB (PN 2 month; right panels), although expression is excluded from apical cell surface and detected along lateral boundaries 
in at least some cells. Cells neighboring NEB (not shown) express higher levels of E-cadherin and have larger apical surface that excludes E-cadherin. Lam, 
laminin y1 (magenta). (D) Before NE selection (E12; left panels) epithelial progenitors are uniformly polarized and tight junction protein ZO-1 (magenta) is detected 
at apical surface of each cell. ZO-1 is downregulated and undetected in intermediates mouse labeled as above) during migration 

(center panels), when surrounding epithelial cells maintain its expression (inset), but is again detected during differentiation (right panels). (E) Snail proteins 
(magenta) are detected in some (filled arrowhead) but not other (open arrowhead) early NE progenitors (Ascii -positive, green) during NE selection (E13; left 
panels), but all intermediates express Snail during migration (middle panels). Intermediates downregulate Snail in concert with Ascii as clusters form and 
differentiate (E1 5; right panels), as seen by declining level of Snail and Ascii in some cells (asterisks) within maturing cluster. Snail proteins are induced again later 
in NE cells (C.S.K., M.A.K., and J. Ouadah, unpublished data). Scale bar (B-F), 10 |im. 



surrounding non-NE epithelial cells. Immunostaining also 
showed induction of neural cell adhesion molecule (NCAM) dur- 
ing NEB formation. NCAM was expressed in class 5 intermedi- 
ates (Figures 7A and S7B), implying a role in NEB assembly or 
cohesion. 

Immunostaining for epithelial apicobasal polarity markers in- 
tegrin p1 , laminin y1 , and zona occludens-1 (ZO-1), and analysis 
of the apicobasal distribution of E-cadherin, showed that devel- 
oping NE cells also undergo dynamic changes in polarity. Progen- 
itors prior to NE cell selection (Figures 7D and S7A), and epithelial 
cells not selected (Figures S7C and S7D), show the expected api- 



cobasal distribution of markers (Figure 7A). However, early NE 
progenitors (class 1 and 2) lose integrin p1 and ZO-1 expression, 
and E-cadherin is distributed around the entire plasma mem- 
brane; the only apparent residual polarity is contact with the lam- 
inin y1 -expressing basal lamina (Figure 7A). Migrating class 3 
intermediates appear to have lost epithelial structure and polarity, 
with E-cadherin expression diminished as noted above and no 
contact with the basal lamina or only residual contact in local 
areas of low or undetectable laminin y1 (Figures 70, 7D, and 
S7A). Olass 4 intermediates begin to reestablish epithelial struc- 
ture with E-cadherin again detected surrounding the entire cell 
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and their thin cytoplasmic extensions converging and contact- 
ing laminin in the basal lamina, suggesting re-initiation of apico- 
basal polarity (Figures 7A and 7D). Class 5 intermediates show 
continued maturation of epithelial structure and polarity that is 
complete in at least some cells, as evidenced by an expanded 
site of contact with laminin and basal lamina, expression of integ- 
rin p1 (Figure S7A) along the entire basolateral surface, and 
expression of ZO-1 at the apical plasma membrane and exclusion 
of E-cadherin from this domain (Figures 7A and 7D). 

These morphological and molecular changes show that NE 
progenitors undergo a transient epithelial-to-mesenchymal tran- 
sition (EMT) during migration and NEB formation. Many EMT 
events are controlled by Snail proteins, zinc finger transcription 
factors that drive the switch (Mani et al., 2008; Strobl-Mazzulla 
and Bronner, 201 2). Using an anti-Snail antiserum to detect Snail 
1 , 2, and 3, none was detected in the earliest NE progenitors or 
any other epithelial cells (Figures 7A and 7E). However, soon af- 
ter initiation of AscH expression, nuclear Snail was detected and 
expression continued throughout cluster formation, before 
declining in class 5 intermediates as NE clusters form and re- 
establish epithelial structure and AscH levels decline. We 
conclude that early NE progenitors induce Snail expression 
and undergo EMT, although it is an unconventional EMT as pro- 
genitors remain associated with the epithelium and the process 
soon reverses as progenitors downregulate Snail and restore 
epithelial structure and polarity on reaching the target. 

DISCUSSION 

We have shown that NEBs form from NE progenitors initially 
distributed apparently randomly in the bronchial epithelium but 
then resolve in 2 days into clusters of 20-30 NE cells at stereo- 
typed, diametrically opposed positions at the base of each bron- 
chial branch. There is little or no progenitor proliferation during 
the process, so NEBs do not form by clonal expansion, as do 
Drosophila and other insect neurosensory organs. There is also 
little if any apoptosis of progenitors or reduction in progenitor 
number, so the clearing of progenitors from regions surrounding 
developing NEBs is not due to programmed cell death. Instead, 
lineage tracing, sparse labeling of progenitors, and high resolu- 
tion imaging of intermediates in vivo, together with live imaging 
of progenitors in lung slices ex vivo, show that progenitors lose 
epithelial character and adopt fibroblast-like morphologies as 
they slowly (1-3 iim/hr for 1-2 days) crawl over and around 
neighboring epithelial cells, converging at diametrically opposed 
sites at the base of each bronchial branch and then rapidly 
(1-2 hr) joining the cluster. The intermediates express EMT tran- 
scription factor Snail and transiently downregulate E-cadherin 
and epithelial polarity proteins and then re-express these adhe- 
sion and polarity proteins and turn on NE differentiation genes 
and NCAM as they re-establish epithelial structure and assemble 
into a neurosensory organ. 

The results lead us to propose that NEBs form by a targeted 
mode of epithelial cell sorting we call “slithering,” in which the re- 
arranging cells transiently lose epithelial structure and polarity 
yet remain intimately associated with the epithelial sheet as 
they traverse neighboring epithelial cells and converge at the 
target site. This mechanism of cell rearrangement differs dramat- 



ically from intercalation, the classical mode of epithelial cell 
rearrangement, in which cells shrink lateral junctions with some 
neighboring cells while expanding junctions with others, 
exchanging positions while maintaining their polarized structure 
(Bertet et al., 2004; Blankenship et al., 2006; Guillot and Lecuit, 
2013). At a mechanistic level, slithering is more reminiscent of 
cell delamination by EMT (Lamouille et al., 2014), such as during 
cancer metastasis (Scheel and Weinberg, 201 2) or in neural crest 
formation where neural tube cells detach from the basement 
membrane and become mesenchyme-like cells that colonize 
distant sites (Strobl-Mazzulla and Bronner, 2012). During slither- 
ing, however, NE precursors never (or only transiently) leave the 
epithelial layer and they move comparatively short distances 
(tens of microns) and then rapidly regain epithelial character, 
so it is only a fleeting conversion. Epithelial cells can also un- 
dergo partial EMT during branching morphogenesis, in some 
organs forming highly dynamic, proliferative cell clusters at 
bud tips (Ewald et al., 2008), and individual proliferating epithelial 
cells can even occasionally send daughter cells to non-adjacent 
positions in the monolayer (“mitosis-associated cell dispersal”) 
(Packard et al., 2013). However, unlike these apparently random 
epithelial cell rearrangements and dispersals, slithering occurs 
without proliferation and is selective, directed, and purposeful. 
It results in active sorting of NE cells from surrounding epithelial 
cells in the monolayer, contrasting with the passive mechanisms 
long envisioned for cell sorting (Krens and Heisenberg, 2011; 
Steinberg, 1963) and forming neurosensory organs at defined 
positions. 

Two prominent features of slithering are its selectivity for NE 
cells and their specific targeting to diametrically opposed posi- 
tions at the base of each bronchial branch, raising the questions: 
what provides the guidance cue and its selectivity for NE cells? 
Because of the dynamic exploratory behavior of NE cells during 
slithering, we favor the idea that the guidance cue is a chemoat- 
tractant. Because only NE cells slither and they do so after 
turning on the master transcription factor AscH, we suggest 
that the chemoattractant receptor is specific for NE cells and 
part of their developmental program. However, there is no 
obvious cell or structure at target sites that could provide a point 
source of chemoattractant, and neurites that innervate NEBs 
arrive at their target only after the clusters form (Figures 2F and 
S3G-S3I). Perhaps the cue is a combination of more broadly 
distributed but overlapping signals, or even a physical signal 
related to the target structure: NEBs form at the most proximal 
and distal positions of the branch junction, where the connecting 
branches form the most obtuse (proximal) and acute (distal) 
angles. Although we have not systematically studied the forma- 
tion of mini-clusters, available data indicate that they, too, form 
by slithering. However, because they do not form at stereotyped 
positions, their clustering could be driven by a homotypic attrac- 
tion signal. A homotypic signal may also contribute to NEB for- 
mation, as real time imaging showed repeated transient contacts 
(“cell kisses”) among progenitors entering a NEB. A high priority 
now is to identify the signal(s) and their sources and receptors 
that control slithering and to determine if the same signals also 
guide outgrowth of neurites that target NEBs. 

Because of the similarity in their developmental origins and 
clustered structures, slithering may also be used to form other 
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epithelial neurosensory organs such as taste buds (Chai et al., 
2000; Okubo et al., 2009) and Merkel touch domes in skin 
that mediate light touch and spatial discrimination (Morrison 
et al., 2009; Van Keymeulen et al., 2009; Wright et al., 2015). 
Because slithering is difficult to detect without membrane 
marking or live imaging of individual intermediates, slithering 
could be a more widely used mechanism of movement and 
sorting of epithelial cells that has been overlooked even in 
well-studied epithelia such as the intestinal crypt, where Paneth 
cells move down from the proliferative zone as other cells 
move up (Batlle et al., 2002). Slithering appears to be a late 
evolutionary innovation, at least in the lung, because although 
all mammals studied have NEBs, the most primitive extant fish 
species have only solitary pulmonary NE cells (Zaccone et al., 
1989). 

It will be important to identify the slithering program and 
indeed the full program of NEB formation including cell selection, 
migration, differentiation, and innervation, and how these pro- 
cesses go awry in Notch pathway mutants (Morimoto et al., 
201 2; Tsao et al., 2009) and other forms of pulmonary NE cell dis- 
ease including those with excess or misplaced NE cells (Nassar 
et al., 2011; Young et al., 2011). One appealing idea is that the 
slithering program is transiently reactivated in their stem cell 
function as they move out of their niche to replace dying neigh- 
boring cells and permanently activated by mutation during onco- 
genesis. The latter could explain why small cell lung cancer 
arising from pulmonary NE cells metastasize early and are the 
most deadly form of lung cancer. 

EXPERIMENTAL PROCEDURES 
Animals 

CD-1 was wild-type strain. Cre alleles and reporters are described in the Sup- 
plemental Experimental Procedures. All animals were maintained and experi- 
ments performed in accordance with Stanford University’s lACUC-approved 
protocols. 

Immunohistochemistry and Histology 

Embryos and lungs from timed pregnancies, with noon of the day of vaginal 
plug detection designated E0.5, were dissected, fixed in paraformaldehyde 
(PFA), and immunostains of whole mount lungs performed as described 
(Metzger et al., 2008) except as noted (Supplemental Experimental Proce- 
dures), then imaged by confocal microscopy or optical projection tomography. 
For cryosections, tissue was fixed in PFA or Zamboni’s fixative, cryoprotected 
in 30% sucrose, frozen in OCT, and stored at -80°C. Frozen tissue blocks 
were sectioned and incubated sequentially with blocking solution, primary 
antibody, secondary antibody conjugated to Alexa fluorophores, and DAPI. 
For full description of antibodies and methods, see the Supplemental Experi- 
mental Procedures. 

Mapping Pulmonary NE Cells 

Serial sections of entire left lobes (n = 5) of E16 CD-1 mice were stained for 
E-cadherin and Ascii and branches L.L1, L.L2, L.L3, L.L4, L.D1, L.D2, L.D3, 
and L.D4 systematically examined for NE cells. Additional ages and branch 
points in left and right lung lobes were analyzed as described (Supplemental 
Experimental Procedures). 

Developmental Analysis of NE Cells 

Serial sections of entire left lung lobes (n = 5 for each stage examined) of E12- 
E16 CD-1 mice were stained for Ascii and E-cadherin, and positions of NE 
cells were determined in branches and at junctions indicated. For NE cell 
counts (Figure S3D), serial sections of the entire left lobe were immunostained 



for Ascii and E-cadherin and cells scored in each segment along left main 
bronchus as indicated. 

For EdU incorporation, 300 ^ig EdU was injected intraperitoneally and lungs 
harvested 2 hr later. EdU was detected by click chemistry in cryosections 
immunostained for Ascii and counterstained with DAPI, then visualized by 
confocal microscopy. For phospho-histone H3 analysis, cryosections were 
immunostained for Ascii and anti-phospho-Histone 3-Ser10. 

Lineage Tracing and Clonal Analysis 

Lineage tracing was done with Shh^''^^'^’Rosa26^^^^^^ mice (airway epithelial 
lineage) and with Wnt1 -Cre;Rosa26^^^''^^'^^^ (neural crest lineage). Lungs 
were harvested between E17.5- E18.5, fixed, cryosectioned, and immuno- 
stained as above. To test monoclonality of NEBs (Figure 3A), cryosectioned 
E^8 Shh^''^^'^-,Rosa26^^'^^'^ lungs, in which all airway epithelial cells are perma- 
nently labeled early in development with one of the three Rainbow fluorescent 
reporters, were immunostained with anti-CGRP and an Alexa633-conjugated 
secondary. Cell number and colors in each NE cluster were assessed by 
confocal fluorescence microscopy. For clonal analysis, individual NE progen- 
itors were sparsely labeled by tamoxifen injection at E1 1 .5 (4 mg tamoxifen) or 
E1 2.5 (3 or 4 mg) of Asc/ 7 mice, and lungs were harvested 
between E17.5-E18.5 and analyzed as above. 

Labeling NE Developmental Intermediates 

mice were induced with tamoxifen at E12.5, and 
lungs were harvested at E1 4.5 or E1 5.5 and serially sectioned, then co-stained 
for PGP9.5, E-cadherin and, for some samples, laminin gammal (y1). NEBs 
and mini-clusters were analyzed by confocal microscopy, and a 35-40 ^im 
z stack was collected and each labeled NE progenitor examined at high reso- 
lution. Three-dimensional reconstructions (Velocity) of each progenitor were 
used to identify cell shape, cytoplasmic processes, and interactions with other 
cells and basement membrane. Detailed descriptions of the morphologic 
classes are given in the Supplemental Experimental Procedures. 

Live Imaging of NE Cells in Slice Culture 

mice were induced with tamoxifen by oral 
gavage at E13 to label pulmonary NE progenitors with ZsGreen and mGFP 
and all other cells with TdTomato. At E15, lungs were harvested and left lobes 
and right caudal lobes separated, embedded in agarose and sectioned (Com- 
presstome). Individual slices (175-|im thick) were transferred to a coverglass 
chamber, covered with Matrigel, and cultured in DMEM +F12 medium with 
10% fetal bovine serum in an environmental chamber at 37°C and 5% CO 2 . 
After 2-3 hr to establish the culture, confocal images were collected every 
20 min for 60 hr. See the Supplemental Experimental Procedures. 

SUPPLEMENTAL INFORMATION 

Supplemental Information includes Supplemental Experimental Procedures, 
seven figures, three tables, and two movies and can be found with this article 
online at http://dx.d 0 i. 0 rg/l 0.1 01 6/j.cell.201 5.09.021 . 
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SUMMARY 

Message-specific translational control is required for 
gametogenesis. In yeast, the RNA-binding protein 
Rim4 mediates translational repression of numerous 
mRNAs, including the B-type cyclin CLB3, which is 
essential for establishing the meiotic chromosome 
segregation pattern. Here, we show that Rim4 forms 
amyloid-like aggregates and that it is the amyloid- 
like form of Rim4 that is the active, translationally 
repressive form of the protein. Our data further 
show that Rim4 aggregation is a developmentally 
regulated process. Starvation induces the conver- 
sion of monomeric Rim4 into amyloid-like aggre- 
gates, thereby activating the protein to bring about 
repression of translation. At the onset of meiosis II, 
Rim4 aggregates are abruptly degraded allowing 
translation to commence. Although amyloids are 
best known for their role in the etiology of diseases 
such as Alzheimer’s, Parkinson’s, and diabetes by 
forming toxic protein aggregates, our findings show 
that cells can utilize amyloid-like protein aggregates 
to function as central regulators of gametogenesis. 



INTRODUCTION 

Production of haploid gametes from diploid progenitor cells is a 
hallmark of sexual reproduction. The process is mediated by a 
specialized cell division, meiosis, where two chromosome divi- 
sions, meiosis I and II, follow a single S phase. While meiosis II 
resembles mitosis in that sister chromatids are separated, 
meiosis I is unusual in that homologous chromosomes rather 
than sister chromatids are segregated. Message-specific trans- 
lational control of cyclins is required for progression through the 
meiotic divisions (reviewed in Kronja and Orr-Weaver, 2011). In 
budding yeast, translational control of the B-type cyclin CLB3 
is essential for establishing the unusual meiosis I division. 
CLB3 mRNA is translationally repressed during meiosis I but is 
translated during meiosis II (Carlile and Amon, 2008). When 
CLB3 is inappropriately expressed during meiosis I, meiosis I 
chromosome segregation is suppressed. Instead, cells undergo 



a mitosis-like division illustrating the importance of translational 
control for gamete formation (Miller et al., 2012). 

During meiosis I, the RNA-binding protein Rim4 represses the 
translation of CLB3 and numerous other mRNAs, the protein 
products of which are required during the late stages of gameto- 
genesis (Berchowitz et al., 2013). Rim4 binds to the 5'UTR 
of CLB3 mRNA and represses translation until the onset of 
meiosis II. At this stage of meiosis, Rim4 is abruptly degraded. 
This degradation is controlled by the meiosis-specific protein 
kinase Ime2 (Berchowitz et al., 2013). These findings made it 
clear that understanding how Rim4 represses translation is 
essential to understanding how meiosis and gametogenesis 
are governed. Here, we shed light on this question. We show 
that Rim4 forms aggregates to repress translation. Rim4 aggre- 
gates are SDS-resistant, bind biotinylated isoxazole (b-isox), 
and can be pelleted and sedimented from detergent-containing 
lysate by ultracentrifugation. In vitro, Rim4 forms fibrillar 
structures containing p sheet. These biochemical properties 
are indicative of fiber-forming leading us to define Rim4 as 
“amyloid-like.” 

Amyloids and amyloid-like aggregates have been implicated 
in processes such as hormone storage (Maji et al., 2009) and 
melanin production in mammals (Fowler et al., 2006) and hetero- 
karyon formation in filamentous fungi (Dos Reis et al., 2002). In 
Drosophila, the Orb2 protein forms oligomers with amyloid-like 
properties that play a critical role in long-term memory persis- 
tence (Majumdar et al., 2012). In mammals, RIP1/RIP3 protein 
kinases form a heterotypic amyloid complex that is critical for 
programmed cell necrosis (Li et al., 2012). 

While some biological functions have been ascribed to amy- 
loids, these aggregates are predominantly understood in the 
context of human diseases such as Alzheimer’s, Parkinson’s, 
amyotrophic lateral sclerosis (ALS), diabetes, and all known 
prion diseases (reviewed in Knowles et al., 2014). Since their 
discovery over 150 years ago (Sipe and Cohen, 2000), the 
main focus of amyloid research has been on understanding the 
proteins’ disease-causing properties. In most, if not all, diseases 
associated with amyloids, the aggregates have been proposed 
to function in a dominant negative manner, sequestering mono- 
meric species into the aggregates causing their depletion 
within cells. In addition to depleting functional proteins, amyloids 
are thought to interfere with cellular functions (Thomas et al., 
1996), cause cell death (Loo et al., 1993), and in cases where 
they are secreted, induce inflammation (Akiyama et al., 2000). 
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Additionally, accumulating evidence suggests that soluble 
oligomeric amyloid precursors are critical drivers of neurotoxicity 
and are often described as inherently toxic (Bucciantini et al., 
2002; Walsh and Selkoe, 2007). Here, we show that proteins 
with amyloid-like properties can control gene expression. We 
demonstrate that the translational repressor Rim4 forms large 
assemblies with amyloid-like biochemical properties and that it 
is the aggregated form of Rim4 that is the active, translationally 
repressive form of the protein. Remarkably, assembly of these 
structures is developmentally regulated. Upon initiation of 
sporulation, one of the inducers of the meiotic fate, starvation, 
induces the switch from monomeric Rim4 into amyloid-like 
Rim4 rendering the protein competent to repress CLB3 transla- 
tion. At the onset of meiosis II, translational repression of CLB3 
is lifted through the degradation of Rim4 aggregates. Our results 
demonstrate that regulated formation of amyloid-like aggregates 
establishes and upholds gametogenesis. 

RESULTS 

Rim4 Is Efficiently Precipitated from Meiotic Lysate 
by B-lsox 

To begin to understand how Rim4 represses translation we used 
computational approaches to identify potential functional do- 
mains within the protein. Rim4 contains three N-terminal RNA- 
recognition motifs (RRMs) that are required for RNA binding 
and translational repression (Berchowitz et al., 201 3). The protein 
also contains a C-terminal low-complexity (LC) region (Figure 1 A) 
and harbors two poly-N stretches and a computationally pre- 
dicted prion domain (Alberti et al., 2009; Soushko and Mitchell, 
2000). Several RNA-binding proteins containing LC sequences 
have the ability to form amyloid-like fibrous aggregates. Addi- 
tionally, LC sequences/prion domains are important for these 
proteins to be recruited into nonmembrane-bound intracellular 
compartments such as stress granules (Gilks et al., 2004) that 
have been shown to function in translational control (Kedersha 
et al., 1999). 

Many fiber-forming RNA-binding proteins can be precipitated 
from aqueous solution or cell extracts by binding to the com- 
pound b-isox via their LC sequences (Kato et al., 201 2). To deter- 
mine whether Rim4 exhibits this property and thus shares 
biochemical similarity with other fiber-forming RNA-binding pro- 
teins, we asked whether Rim4 binds to b-isox. In order to obtain 
a uniform population of cells in which Rim4 inhibits translation, 
we induced cells lacking the transcription factor Ndt80 to 
undergo sporulation. Cells lacking Ndt80 progress through the 
early stages of the meiotic cell cycle efficiently but arrest in G2 
prior to meiosis I entry (Xu et al., 1995). In this cell-cycle stage, 
Rim4 is active and translation of CLB3 is inhibited (Berchowitz 
et al., 2013). We found that b-isox efficiently precipitated Rim4 
from lysates prepared from cells lacking NdtSO (Figure 1B). 

To determine more precisely when during meiosis b-isox was 
able to precipitate Rim4, we examined cells induced to progress 
through a synchronous meiosis using the NDT80 block-release 
system. In this system, NDT80 expression is driven from the 
GAL1-10 promoter (pGAL-NDT80) that in turn is controlled 
by an estradiol-regulatable Gal4-ER fusion (GAL4.ER). pGAL- 
NDT80 cells will arrest in meiotic G2 in sporulation medium 



lacking estradiol but will progress synchronously through the 
meiotic divisions upon estradiol addition (Benjamin et al., 2003; 
Carlile and Amon, 2008). Rim4 was efficiently precipitated by 
b-isox for 2 hr following the release from the G2 block, when cells 
were in meiosis I (Figures 1C and 1D). As shown previously 
(Berchowitz et al., 2013), when cells enter meiosis II, Rim4 was 
degraded (Figures 1C and 1D). 

To assess the specificity of b-isox, we used quantitative mass 
spectrometry to query the yeast proteome for meiotic b-isox 
binding proteins. We released cells from the NDT80 block and 
collected samples spanning the two meiotic divisions (Figure S1 ). 
B-isox was remarkably selective. Rim4 was the most enriched 
protein during early meiosis I (1 hr after release from the G2 
block; Figures 1 E and 1 F) and only three other proteins, among 
them the prion Sup35, exhibited significant enrichment at this 
time (Figure 1E; Table S1). We conclude that Rim4, like prions 
and other proteins with amyloid-like properties, binds to b-isox 
and does so during the stages of meiosis when the protein 
actively inhibits CLB3 translation. 

Rim4 Forms Amyloid-like Aggregates In Vitro 

Yeast prions and other amyloidogenic proteins such as Hunting- 
tin form amyloid aggregates via regions within the protein rich in 
N and Q residues (DePace et al., 1 998; Scherzinger et al., 1 997). 
Given that Sup35 and Rim4 were efficiently precipitated by 
b-isox and that Rim4 contains poly-N stretches, we next 
asked whether Rim4, like Sup35, forms amyloid-like aggregates 
in vitro. We generated highly purified recombinant Rim4 harbor- 
ing minimal nucleic acid contamination (Figures S2A-S2C) 
and analyzed concentrated protein by transmission electron 
microscopy (TEM). At a concentration of 40 mg/ml, Rim4 formed 
fibrillar helical structures with a fixed width of ~1 5 nm and length 
up to 1 5 |im visible by both negative stain and cryoelectron TEM 
(cryo-TEM) (Figure 2A). The ability to form fibers depended 
on the protein’s C-terminal LC sequences. A mutant derivative 
lacking the C-terminal 294 amino acids but leaving the three 
RRMs intact (Rim4A294C) did not form long helical fibers (Fig- 
ure 2B). Because Rim4 is an RNA-binding protein (Berchowitz 
et al., 2013), we examined the effects of RNA on Rim4 fiber 
formation. Incubation of non-fibrous Rim4 with a bona fide 
RNA target, the CLB3 5'UTR, or total RNA did not induce fiber 
formation as assessed by TEM (Figure S2D). Furthermore, 
RNase treatment of pre-formed fibers did not result in fiber 
disassembly (Figure S2E). These results indicate that fiber for- 
mation in vitro can occur in the absence of RNA. 

Biochemical analyses further demonstrated that Rim4 fibers 
are composed of p sheets, a hallmark of amyloid. Circular 
dichroism (CD) spectrum analysis of fibrous Rim4 revealed a 
minimum at 21 8 nm (Figure S2F), which is characteristic of struc- 
tures rich in p sheet. In contrast, the CD spectrum of Rim4A294C 
was predominated by a-helical signature (Figure S2F). Further- 
more, Rim4 but not Histone H1 bound to Thioflavin T (ThT), 
a dye that stains the p sheet-rich structures of amyloid and 
not monomers (Nilsson, 2004) (Figure 2C). Rim4 lacking the 
C-terminal LC sequences also bound Thioflavin T but with 
reduced affinity compared to full-length protein (Figure 2C). We 
speculate that the N-terminal disordered sequence (Figure 1A) 
mediates the production of some p sheet structures in the 
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Figure 1. B-lsox Binds Rim4 in Meiosis I 

(A) Diagram of Rim4. 

(B) RIM4-3V5, pGAL-NDT80, and GAL4.ER (A30868) cells were induced to sporulate at 30°C. After 6 hr when cells had arrested in G2 due to the lack of Ndt80, 
lysates were prepared. Lysates were incubated with 1 00 pM b-isox (or DMSO control) and Rim4 abundance was examined by western blot analysis. Pgk1 did not 
bind b-isox. 

(C) Cells were grown as in (B) except cells were released from the G2 block by the addition of 1 pM p-estradiol. Lysates were prepared from G2-arrested cells (0 hr) 
and at 1 , 2, and 3 hr post-release and analyzed as in (B). 

(D) Meiotic staging data for (C). The percentage of metaphase I, anaphase I, metaphase II, and anaphase II cells was determined. 

(E) pGAL-NDT80 (A15055) strains carrying the GAL4.ER were grown and lysates prepared as in (C). Precipitated proteins were ITRAQ-labeled and analyzed by 
mass spectrometry. Shown are the ratios of b-isox enrichment/DMSO for each identified protein converted to Z scores. 

(F) Meiotic staging data from (E). 

See also Figure SI for detailed workflow and Tables SI and S2. 

truncated protein. In concordance with these results we recognized to a much lesser extent and Histone H1 not at all. 
observed that Rim4 is recognized by an a-amyloid fibril antibody We note that unlike other LC-sequence containing proteins 

(a-amyloid fibrils OC, Millipore) (Figure 2D). Rim4A294C is (Kato et al., 2012), Rim4 did not form hydrogels, even when 
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Figure 2. Rim4 Forms Amyloid-like Aggre- 
gates In Vitro 

(A and B) Transmission eiectron microscopy (TEM) 
of fibers formed by recombinant Rim4 and 
Rim4A294C (aa 1-420) incubated at 4°C overnight. 
The top three paneis are micrographs depicting 
negativeiy stained sampies (prepared at 40 mg/mi 
and 1 mg/mi). The bottom ieft paneis depict Rim4 
sampies anaiyzed by cryo-EM at concentrations of 
40 mg/mi (ieft) and 4 mg/mi (right). Scaie bars are 
shown in white. 

(C) Recombinant Rim4 and Rim4A294C (various 
concentrations diiuted to 1 mg/mi just before 
assay) were incubated with 16 ng/mi Thioflavin T, 
and the fluorescence intensity was measured as in 
Nilsson (2004). Histone H1 at 4 and 40 mg/ml was 
used as a control. 

(D) Recombinant protein (1 0 ^ig) was slot-blotted 
onto nitrocellulose and assayed for reactivity 
with a-amyloid fibril OC. Rim4, Rim4A294C, and 
Histone H1 concentrated to 40 mg/ml were as- 
sayed. Amyloid beta (A(3, Anaspec) concentrated 
to 0.25 mg/ml was used as a positive control. 

See also Figure S2 and Table S2. 



incubated at 4°C for months at high concentrations. Neverthe- 
less, our analyses demonstrate that recombinant Rim4 forms ag- 
gregates with several morphological and biochemical properties 
of amyloid in the absence of other proteins or nucleic acids and 
that the C-terminal LC sequences are critical for fiber formation. 

Evidence that Rim4 Forms Aggregates In Vivo 

Amyloid-like aggregates can form stable structures in vivo as 
judged by fluorescence recovery after photobleaching (FRAP) 
(Kayatekin et al., 2014). For example, when the amyloidogenic 
Fluntingtin protein harboring a greatly expanded poly-Q stretch 
(Htt103Q) is expressed in yeast, aggregated foci display very 
little signal recovery after photobleaching whereas non-aggre- 
gating Huntingtin proteins exhibit rapid signal recovery (Kayate- 
kin et al., 2014). To determine whether Rim4 forms aggregates 
in vivo, we analyzed the FRAP kinetics of wild-type and aggre- 
gate-defective Rim4 that lacks the C-terminal 204 amino acids 
(Rim4A204C) in sporulating cells (Figures 3A and 3B). After pho- 
tobleaching, wild-type Rim4 showed minimal recovery while the 
Rim4A204C signal recovered with similar kinetics as the soluble 
cytoplasmic enzyme Pgk1 . This result demonstrates that Rim4 
structures within the cell are largely static. This rigidity depends 
on Rim4’s C-terminal 204 amino acids. 

The localization pattern of Rim4 is also consistent with the 
protein forming aggregates in cells during stages of meiosis 
when the protein inhibits translation. In meiotic G2-arrested 
cells, Rim4 protein levels were similar between wild-type and 
Rim4A204C as judged by signal intensity within a fixed area of 
the cell (Figures 3C and 3D). However, whereas the Rim4A204C 
signal was uniformly distributed throughout the cytoplasm, wild- 
type Rim4 appeared more focal, leading to a more granular 
staining pattern (Figure 3C). This was most evident when we 
analyzed heterogeneity in pixel intensity by comparing the 
SDs of the mean pixel signal intensity within a defined area 



of cytoplasm. The SD was significantly higher for wild-type 
Rim4 than the aggregate-defective Rim4A204C mutant protein 
(Figure 3E). 

To determine whether the C-terminal LC sequences of Rim4 
affect Rim4 clearance, we compared protein levels of wild-type 
Rim4 and the Rim4A204C mutant in live cells undergoing meiosis. 
To ensure that the Rim4 mutant protein did not affect meiotic 
progression, we conducted the experiment in diploid cells that 
carried one wild-type copy of RIM4 and either one EGFP-tagged 
wild-type RIM4 or rinn4L204C allele. Rim4 accumulated during 
premeiotic S phase and persisted throughout meiosis I at which 
point Rim4 was abruptly degraded (Figures 3F, S3A, and S3B; 
Movies S1 , S2, S3, and S4). Surprisingly, Rim4A204C was not 
cleared from cells efficiently compared to Rim4-EGFP (Figures 
3F and S3C; Movies S5 and S6). The mutant protein persisted 
in cells well beyond exit from meiosis I. This result indicates that 
the C-terminal 204 amino acids are important for Rim4 aggrega- 
tion in vivo and contain elements necessary for proper Rim4 
clearance at the onset of meiosis II. 

Rim4 Forms SDS-Resistant Aggregates during Meiosis 

To further examine whether Rim4 indeed forms amyloid-like 
aggregates in vivo, we employed semi-denaturing detergent 
agarose gel electrophoresis (SDD-AGE). Amyloid (as opposed 
to amorphous or globular) aggregates are SDS-resistant and 
SDD-AGE allows for the resolution of these structures (Alberti 
et al., 2009). Rim4 formed SDS-resistant aggregates in cells pro- 
gressing through meiosis in a synchronous manner (Figure 4A). 
Previously described fiber-forming (hydrogelling) RNA-binding 
proteins do not form SDS-resistant structures (Kato et al., 
2012). Thus, Rim4 forms different structures from the fiberizing 
RNA-binding proteins that form hydrogels. However, we cannot 
rule out that Rim4 aggregates adopt a higher-order static hydro- 
gel state within the cell. 
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Figure 3. Evidence that Rim4 Forms Aggregates in Cells 

(A and B) Rim4 forms static aggregates in cells. Strains heterozygous for HTB1-mCherry and homozygous for either RIM4-EGFP (A36245) or rim4/1204C-EGFP 
(A36706) were induced to sporulate at 30°C. The mobility of Rim4 aggregates was determined by FRAP of live cells. Cells were imaged starting at 5 hr after 
transfer into sporulation medium. A haploid mitotic culture expressing Pgk1 -GFP (A34422) was used as a soluble control. Bleached areas were normalized to an 
unbleached control area. Time point zero was the first observed time point after bleaching. Representative cells are shown in (A) with Rim4 in green and the 
nucleus in red (scale bar, 5 pm). Signal recovery over time is shown in (B). Plotted is the mean and SEM of ten cells per strain. 

(C-E) pGAL-NDT80, GAL4.EP cells harboring PIM4-mCherry (A35632), or rim4A.204C-mCherry (A37237) were induced to sporulate at 30°C. After 6 hr when cells 
had arrested in G2 due to the lack of NdtSO, the cells were fixed and imaged. (C) Single-plane images of representative cells arrested in G2 are shown. Scale bar, 
5 pm. (D and E) Rim4-mCherry signal (vacuole excluded) was quantified in the central z plane. The mean Rim4 signal is not significantly different between the two 
strains, however, the SD (aggregation proxy) is significantly different, n = 50 cells per strain. 

(F) Cells either harboring a PIM4-EGFP (A36775) or rim4A204C-EGFP (A36773) fusion were induced to sporulate at 30°C. After 1 hr of growth in batch culture, 
the cells were loaded onto a microfluidics chip and imaged every 20 min. A quantification of Rim4-EGFP signal over time is shown. Plots were aligned according 
to the first meiotic division. Cells harboring PIM4-EGFP and rim4/1204C-EGFP are shown in black and red, respectively. Plotted is the mean and SEM often cells 
per strain. Scale bar, 5 pm. 

See also Figure S3, Table S2, and Movies SI , S2, S3, S4, S5, and S6. 
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Figure 4. Rim4 Forms Amyloid-like Aggre- 
gates In Vivo to Repress Translation 

(A) Rim4 amyloid-like aggregates form during 
premeiotic S phase that are cleared during 
meiosis II. pGAL-NDT80 cells (A30868) were 
induced to sporulate and Rim4’s ability to form 
SDS-resistant aggregates was analyzed as in 
Alberti et al. (2009). Samples were incubated in 2% 
SDS for 10 min and resolved on 1.7% agarose 
(with 0.1% SDS), and Rim4 was detected by 
immunoblot analysis. A strain harboring Rim4 
lacking its C-terminal LC sequences (A33848) is 
shown as a control. 

(B and C) Western blot analysis of time course 
shown in (A). Total levels of Rim4-3V5, Clb3-3HA, 
and Pgk1 (loading control) are shown in (B). 
Quantification of blots is shown in (C). Rim4 SDS- 
resistant aggregates were quantified from SDD- 
AGE in (A) and Clb3 from SDS-PAGE in (B). Total 
levels of Rim4 and Clb3 were normalized to Pgk1 . 
(D-F) CLB3 translational repression begins be- 
tween 2 and 3 hr after induction of sporula- 
tion (see Figure S4A for detailed workflow). A 
culture harboring RIM-3V5, GAL4.ER, and pGAL- 
5' UTRclb3~CLB3 (A35430) was split and induced 
to sporulate at 30°C. pGAL-5' UTRclb3~CLB3 was 
induced by p-estradiol at the indicated time. Rim4- 
3V5, Clb3-3HA, and Pgk1 (loading control) protein 
and CLB3 mRNA and rRNA (loading control) levels 
were determined at the indicated times following 
p-estradiol addition (D). SDS-resistant Rim4 ag- 
gregates were analyzed in (E). A quantification of 
Rim4 aggregate formation (open circles) and Clb3 
translation (Clb3 protein/C/_B3 mRNA after 60 min 
induction: closed circles) is shown in (F). 

See also Table S2. 





Rim4 aggregates formed as soon as the protein was produced 
during premeiotic S phase and persisted throughout meiosis I. 
During meiosis II, SDS-resistant aggregates were no longer 
detected. We note that while SDS-resistant Rim4 aggregates 
were absent during meiosis II, we did detect Rim4 protein in 
extracts prepared by boiling cells in SDS-PAGE buffer (compare 
Figures 4A and 4B). We can envision two not mutually exclusive 
explanations for this observation. First, incomplete synchrony 
of cells progressing through meiosis could be responsible. The 
Rim4 protein detected in meiosis II time points by SDS-PAGE 



could originate from cells that either did 
not enter sporulation or arrest at some 
point in meiosis prior to Rim4 aggregate 
formation. Second, Rim4 aggregates 
could be disassembled in meiosis II 
before they are degraded. We note that 
monomeric Rim4 is not detected in 
SDD-AGE in these late meiotic time 
points, but previous studies indicated 
that SDD-AGE does not reliably detect 
monomeric proteins (Halfmann et al., 
2012). It is also possible that monomeric 
Rim4 is unstable (indeed the protein is 
degraded during meiosis II) and hence 
degraded during the lengthy SDD-AGE extract preparation 
procedure. Irrespective of whether or not Rim4 aggregates are 
disassembled prior to their degradation, the presence of amy- 
loid-like Rim4 aggregates in cells correlated with Rim4-mediated 
translational repression. Clb3 protein levels accumulated as 
Rim4 aggregates were degraded (Figures 4A-4C). This finding 
raises the interesting possibility that it is the amyloid-like form 
of Rim4 that mediates translational repression. 

To further explore whether translational repression of CLB3 
correlates with Rim4 aggregation, we utilized a construct where 
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CLB3 expression can be induced at will from the GAL1-10 pro- 
moter (driven by a Gal4-ER fusion) but that preserves transla- 
tional control of CLB3 (henceforth pGAL-5' UTRclb3-CLB3). If 
Rim4 amyloid-like aggregation is important for its function, 
CLB3 translational repression should correlate with aggregate 
formation. To test this prediction, we assessed when during early 
stages of sporulation CLB3 becomes translationally repressed. 
We induced expression of CLB3 from the pGAL-5'UTRclb3- 
CLB3 fusion at various times after the induction of sporulation 
in cells lacking NDT80 that progressed through sporulation in 
a synchronous manner (for a detailed description of the experi- 
mental setup see Figure S4A). CLB3 translation occurred effi- 
ciently within the first 2 hr after induction of sporulation. By 
3 hr, however, when cells had entered pre-meiotic S phase, 
CLB3 translation was repressed as judged by the lack of Clb3 
protein despite ample induction of CLB3 expression (Figures 
4D and S4B). Translational repression persisted until the end of 
the experiment when cells had arrested in G2 due to the lack 
of NDT80. Loss of CLB3 translation correlated remarkably well 
with the formation of SDS-resistant Rim4 aggregates (Figures 
4E and 4F). Taken together, these data indicate that cells 
generate RNA-binding amyloid-like aggregates of Rim4 to 
mediate translational control of gene expression in vivo. 

The LC Sequences of Rim4 Mediate Formation of 
Amyloid-like Aggregates and Translational Repression 

To determine whether the formation of amyloid-like Rim4 
aggregates was required for translational repression of CLB3 
mRNA, we first identified the region within Rim4 responsible 
for aggregation in vivo. Two recent reports have shown that 
RNA stimulates formation of amyloid-like fibers of the RNA- 
binding protein FUS by increasing local concentration and 
thus nucleating assembly (Kato et al., 2012; Schwartz et al., 
2013). Does Rim4’s RNA-binding property mediate amyloid- 
like aggregation? To address this question, we analyzed 
SDS-resistant aggregate formation in a rim4-F139L mutant. 
The riiri4-F139L mutant encodes a protein in which a critical 
phenylalanine residue in the first RRM domain is mutated to 
leucine (Soushko and Mitchell, 2000). The RRM mutant formed 
amyloid-like aggregates equally well as wild-type Rim4 in vitro 
and in vivo. Recombinant Rim4-F139L purified and concen- 
trated to 27 mg/ml forms fibers in vitro (Figure 5A) and SDS- 
resistant aggregates in vivo (Figure 5B). We conclude that 
RNA-binding is not a prerequisite for aggregation in vitro and 
in vivo. 

The LC sequences of Rim4 are a prime candidate for medi- 
ating the formation of Rim4 amyloid-like aggregates. Indeed, 
deleting the C-terminal 271 amino acids that comprise the LC 
sequences (Figure 5C) abrogated the formation of SDS-resistant 
aggregates in vivo (Figure 4A). As an alternative method to 
assess the importance of the C-terminal region of Rim4 in 
aggregate formation, we employed differential centrifugation. 
The unusual detergent-insolubility of amyloid-like aggregates 
allows for the sedimentation of these particles by ultracentrifuga- 
tion from native lysates in the presence of detergent (Sond- 
heimer and Lindquist, 2000). We assayed the sedimentation 
properties of RIM4-3V5 as well as two riiri4 mutants lacking 
either most (riiri4/^271C-3V5) or a portion of the LC sequences 



(rim4M38C-3V5). We observed full-length Rim4 protein pre- 
dominantly in the pellet fraction while the aggregate-defective 
mutants remained in the supernatant (Figure 5D). We conclude 
that the C-terminal LC sequences of Rim4 are required for aggre- 
gation of the protein in vivo. 

If amyloid-like aggregates are required for translational con- 
trol, then abrogation of Rim4 aggregation should result in loss 
of CLB3 translational repression. To test this prediction, we 
examined the consequences of deleting increasingly larger por- 
tions of the C-terminal LC sequences of Rim4 on aggregate 
formation and CLB3 translation. We arrested cells in meiotic 
G2 by deleting NDT80 and then induced expression of CLB3 
from the aforementioned pGAL-5' UTRclb3-CLB3 fusion. In cells 
expressing full-length Rim4, CLB3 translation was inhibited 
(Figures 5C and 5E). However, deleting as few as the C-terminal 
66 amino acids led to CLB3 translation in G2-arrested cells (Fig- 
ures 5C and 5E). Larger deletions further increased translation 
efficiency. 

Expressing Clb3 in G2-arrested cells had consequences. 
Upon CLB3 induction, rim4 aggregation-defective cells inap- 
propriately formed meiosis I spindles in the G2 arrest, causing 
the appearance of cells with characteristics of both the ndt80A 
G2 arrest and meiosis I. Cells harbored separated spindle pole 
bodies (a meiosis I characteristic), and the synaptonemal com- 
plex protein Zip1 accumulated in the nucleus (a r?c/f80A arrest 
characteristic; Figure S5). Importantly, Rim4 truncations that 
failed to inhibit CLB3 translation also failed to form SDS-resis- 
tant aggregates (Figure 5F). These findings show that aggrega- 
tion and inhibition of translation are mediated by Rim4’s LC 
domain. 

Because the larger C-terminal deletions led to an ~50% 
decrease in Rim4 protein levels (Figure 5E), we also examined 
the effects of reducing the levels of full-length Rim4 by a similar 
degree. A RIM4/rim4/^ heterozygous strain produces approxi- 
mately equivalent amounts of Rim4 as a strain carrying two 
copies of a RIM4 allele producing a truncation lacking the 
C-terminal 138 aa (riiri4M38C-3V5) but translational repression 
was largely unaffected compared to rim4M38C-3V5 cells (Fig- 
ures 5E and S5). We conclude that the loss of translational 
repression observed in riiri4 aggregation-defective mutants 
is not due to decreased protein levels. Rather, we propose 
that it is caused by an inability to form high molecular weight 
aggregates. 

Rim4 Amyloid-like Aggregates Sequester CLB3 mRNA 

Aggregation-defective riiri4 mutants could abrogate transla- 
tional control because they fail to bind RNA, repress translation, 
or both. To determine whether amyloid-like aggregation of 
Rim4 is important for RNA-binding, we performed extract-bind- 
ing studies. We transcribed the CLB3 5'UTR along with two 
controls in vitro: the CLB3 5'UTR with a 25 nucleotide (nt) dele- 
tion and the 248 nt CLB1 5'UTR. These controls bind Rim4 
with decreased affinity compared to the CLB3 5'UTR (Bercho- 
witz et al., 2013). We coupled these RNAs via a biotin group to 
streptavidin beads and assayed the ability of the RNA to pull 
down Rim4 or an aggregation-defective mutant (Rim4A138C) 
from extracts prepared from G2-arrested cells, when Rim4 
inhibits translation. We found that the RNA-binding activity of 
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Figure 5. The LC Sequences, but Not RNA-Binding, Are Required for Rim4 Aggregation 

(A) TEM of fibers formed by Rim4-F139L (RRM mutant) concentrated to 27 mg/ml and incubated at 4°C overnight. 

(B) RNA binding is not required for amyloid-like aggregation. Rim4 was analyzed by SDD-AGE 6 hr after transfer into sporulation-inducing conditions of cells 
harboring rim4A271C-3V5 (A35408), a no-tag control (A15055), RIM4-3V5 (A35430), and rim4-F139L-3V5 (RRM mutant A35324). 

(C) Diagram of Rim4 C-terminal truncations. 

(D) Rim4 can be pelleted from detergent-containing undenatured lysate. Strains harboring RIM4-3V5 (A35430), rim4A138C-3V5 (A35402), and rim4A271C-3V5 
(A35408) were arrested in G2 as above. Native lysates were prepared and spun for 60 min at 250,000 x g. Rim4 levels were determined in total lysate, pellet, and 
supernatant. 

(E and F) rim4 mutants that fail to form amyloid-like aggregates are defective for translational control. Strains harboring RIM4-3V5 (A35430), heterozygous RIM4- 
3V5/rim4^ (A35841), and constructs shown in (C) (A35399, A35402, A35405, A35408) as well as pGAL-5'UTRclb3~CLB3, ndt80^, ZIP1-GFP, SPC42-mCherry, 
and GAL4.EP were arrested in G2 (6 hr) and pGAL-5' UTRclb3-GLB3 was induced by |3-estradiol addition. Rim4-3V5, Clb3-3HA, and Pgk1 (loading control) 
protein and CLB3 mRNA and rPNA (loading control) were determined (E). SDS-resistant Rim4 aggregates (at 6 hr) were analyzed in (F). The Sup35 aggregate- 
forming strain ([PS/+]; A25208) was used as a positive control. 

See also Figure S5 and Table S2. 



the aggregation-defective Rim4 was reduced by 50% (Figures 
6A and 6B). These results indicate that the ability of Rim4 to 
aggregate is important for its ability to bind RNA. However, the 
2-fold reduction in RNA-binding exhibited by Rim4A138C is un- 
likely to explain the >20-fold increase in translational efficiency of 



the mutant. We propose that while amyloid-like aggregation is 
required for efficient RNA-binding, Rim4 aggregates participate 
in translational control beyond their role in RNA-binding. 

Having determined that Rim4 aggregation plays only a minor 
role in RNA binding, we next tested the possibility that the 
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Figure 6. Effects of the LC Sequences on Rim4 RNA Binding and CLB3 mRNA Mobility in Sucrose Gradients 

(A and B) An aggregation-defective rim4 mutant has decreased RNA-binding activity. Extracts from RIM4-3V5 (A35430) and rim4M38C-3V5 (A35402) cells 
arrested in G2 (6 hr) were incubated with in vitro transcribed 3' biotinylated RNAs conjugated to streptavidin magnetic beads. Beads were recovered and boiled in 
SDS loading buffer to release bound factors. Shown is the amount of Rim4 recovered using the indicated RNA baits (A). Quantifications (B) were input-normalized 
and are shown as fold enrichment over the no-RNA control. 

(C and D) Rim4 and its bound mRNA targets exist in heavy mRNPs. Strains harboring RIM4-3V5 (A35430) and rim4M38C-3V5 (A35402) were arrested in G2 as 
above except CLB3 was expressed by the addition of (3-estradiol for 60 min prior to collection. Samples were fixed in 1 % formaldehyde on ice for 1 5 min (stopped 
by the addition of 0.1 M glycine) to preserve the RNPs (Valasek et al., 2007). Native lysates were fractionated on 10%-50% sucrose gradients. Sedimentation 
of Rim4, Pgk1 , as well as CLB3 mRNA and 18S rRNA was determined by immunoblot and qPCR, respectively. RNA quantification was normalized to luciferase 
RNA that was spiked into each fraction in equal amounts to normalize for RNA extraction efficiency. 

See also Table S2. 



Rim4 amyloid-like aggregates mediate repression of CLB3 
mRNA translation. If Rim4 aggregates repress translation, 
CLB3 mRNA should cofractionate with Rim4 aggregates but 
not Rim4 monomers or oligomers. Furthermore, in rim4 mutants 
that fail to form amyloid-like aggregates, CLB3 mRNA ought to 
fractionate with translating ribosomes. To test these predictions, 
we fractionated lysates from RIM4-3V5 and rim4M38C-3V5 
cells arrested in meiotic G2 (when CLB3 is translationally 
repressed) in 10%-50% sucrose gradients. This analysis 
showed that both Rim4 and CLB3 mRNA were present in the 
heaviest fractions of the gradient (Figures 6C and 6D). In 
contrast, Rim4A138C was predominantly present in the light 
fractions of the gradient and did not co-fractionate with CLB3 
mRNA. Instead, CLB3 mRNA was found enriched in heavy frac- 



tions of the gradient likely due to its association with translating 
ribosomes. Our analyses of wild-type and Rim4 mutant proteins 
in SDD-AGE as well as sedimentation and sucrose gradients 
indicate that it is the aggregated form of Rim4 that mediates 
translational repression. 

Gametogenesis Signals Regulate Rim4 Amyloid 
Formation 

Our results show that Rim4 forms amyloid-like aggregates 
during meiosis I to inhibit CLB3 translation. Is this process regu- 
lated, and if so, which aspect of the regulatory network that 
induces sporulation controls the aggregation of Rim4? Two sig- 
nals trigger gametogenesis in budding yeast: mating type and 
starvation (reviewed in van Werven and Amon, 2011). Only cells 
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Figure 7. Formation of Rim4 Amyloid-like Aggregates Is Regulated by Starvation 

(A and B) Starvation triggers Rim4 aggregation and transiationai repression (for detaiied workflow see Figure S6A). Haploid GAL4.ER, pGAL-5' UTRclb3~CLB3 
strains with or without pCUP1-RIM4-3V5 (A34980, A28492) were diluted to ODeoo = 1 .8 in either YEPD or SPO medium. pCL/P7-f?/M4 was induced by the addition 
of 25 lam CUSO4. pGAL-5' UTRclb3~CLB3 was induced by 2 |iM |3-estradiol prior to (0 hr) and after (6 hr) CUSO4 induction for 60 min. Rim4-3V5, Clb3-3HA, and 
Pgk1 (loading control) protein and CLB3 mRNA and rRNA (loading control) levels were determined at the indicated times in (A). SDS-resistant Rim4 aggregates 
(prior to and after CUSO4 induction) were analyzed by SDD-AGE in (B). 

(C) Monomeric Rim4 is converted into amyloid-like aggregates upon starvation (for detailed workflow see Figure S6B). A haploid pGAL-RIM4-3V5 strain was 
grown in BYTA medium in the presence of 1 |iM (3-estradiol for 3 hr to induce Rim4 expression. Cells were then shifted to SPO medium. Rim4 SDS-resistant 
aggregates were assayed. Rim4 and Pgk1 protein levels from TCA extracts were analyzed in Figure S6C. 

(legend continued on next page) 
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of the MATa/a mating type can undergo sporulation and they do 
so only when they are starved, that is when they are deprived of 
nitrogen and a fermentable carbon source. 

To determine whether formation of Rim4 amyloid-like aggre- 
gates is regulated by these gametogenesis-inducing signals, 
we asked whether ectopically expressed Rim4 aggregates in 
haploid cells grown in nutrient-rich medium supplied with ample 
nitrogen and glucose (YEPD medium). RIM4 expressed under 
the control of the copper-inducible CUP1 promoter readily pro- 
duced Rim4 protein in YEPD medium but the protein did not 
form amyloid-like aggregates as judged by the lack of SDS- 
resistant Rim4 particles (Figures 7A, 7B, and S6A). Accordingly, 
CLB3 was translated effectively (Figure 7A). In contrast, starva- 
tion caused Rim4 to form SDS-resistant aggregates. We starved 
haploid cells for 4 hr in sporulation medium and then induced 
Rim4 expression. Rim4 readily aggregated under these growth 
conditions and inhibited translation of CLB3 (Figures 7A and 
7B). These results indicate that the gametogenesis-inducing 
starvation signal causes the formation of Rim4 amyloid-like ag- 
gregates. Our data further demonstrate that monomeric Rim4 
cannot repress CLB3 translation. 

To directly test that starvation induces Rim4 aggregation, we 
expressed Rim4 in haploid cells grown in nutrient rich conditions 
(BYTA) when the protein is largely in its monomeric form. We 
then repressed RIM4 expression and simultaneously starved 
cells by transferring them into sporulation medium (Figures 7C 
and S6B). Within 30 min of inducing starvation, monomeric 
Rim4 converted into amyloid-like aggregates (Figures 7C and 
S6C). We conclude that aggregation of Rim4 is a regulated pro- 
cess. Monomeric Rim4 assembly into amyloid-like aggregates is 
induced by starvation signals to convert the protein into an am- 
yloid-like translational repressor. 

DAZL Forms SDS-Resistant Aggregates In Vivo 

Although budding yeast harbors many proteins with RRM do- 
mains and a prion domain, only Rim4 is sporulation-specific. In 
humans, there are only four such gametogenesis-specific pro- 
teins. They are DAZ1-DAZ4 (King et al., 2012). These four pro- 
teins exist as a cluster on the Y-chromosome- a region of the 
Y that is often found deleted in infertile men with a condition 
called azoospermia (Reijo et al., 1995). While DAZ exists only 
in humans and old world monkeys, one of its autosomal homo- 
logs DAZL (DAZ-WWe) is found from fish to humans (Haag, 
2001). Like Rim4, DAZL functions in the regulation of translation 
(Collier et al., 2005) and is essential for gametogenesis in many 
organisms, including mice (Ruggiu et al., 1997). We found that 
mouse DAZL forms SDS-resistant aggregates in cells undergo- 
ing gametogenesis (testes), but not in brain, intestine, and liver 
(Figures 7D and 7E). We propose that DAZL (and possibly 
DAZ) functions to regulate translation of gametogenesis-specific 
RNAs in a manner similar to Rim4 and suggest that amyloid-like 
aggregation of RNA-binding proteins is an evolutionarily 
conserved feature of gametogenesis. 



DISCUSSION 

Here, we show that amyloid-like aggregates, generally consid- 
ered to be inactive and often toxic, play a central role in creating 
the cell division essential for gamete formation and hence sexual 
reproduction. Rim4 forms amyloid-like aggregates to repress the 
translation of CLB3 during meiosis I that is critical to establish the 
meiosis I chromosome segregation pattern. Importantly, forma- 
tion of these structures is regulated. Starvation, one of the sig- 
nals that triggers gamete development, induces the conversion 
of monomeric Rim4 into aggregated Rim4. Thus, starvation not 
only initiates gametogenesis but also sets up the mechanism 
that brings about the meiotic chromosome segregation pattern 
much later during gamete formation. 

Rim4 Aggregation Is a Regulated Process 

Several lines of evidence demonstrate that Rim4 forms large am- 
yloid-like aggregates in vivo. The protein is immobile and found in 
granular structures within cells, forms SDS-resistant aggregates 
when isolated from cell extracts, can be sedimented and fraction- 
ated from undenatured detergent-containing lysates, and can be 
quantitatively precipitated from cell extracts using b-isox. The 
protein’s in vitro properties: Thioflavin T binding, fiber formation, 
CD spectrum, and reactivity with an anti-amyloid antibody further 
support the idea that Rim4 has amyloid-like properties. 

Our data lead to the conclusion that aggregation of Rim4 is a 
regulated process. When we express Rim4 in cells grown in 
nutrient-rich conditions the protein is monomeric, but upon 
transfer into sporulation-inducing conditions the protein quickly 
converts into the aggregated form. During normal progression 
through sporulation, we do not observe this conversion. This is 
not surprising because RIM4 is only induced during premeiotic 
DMA replication, after cells have been transferred into starva- 
tion-inducing medium. Thus, Rim4 is immediately assembled 
into amyloid-like aggregates upon its expression. We do not 
yet know how starvation induces Rim4 assembly. Downregula- 
tion of the RAS and TOR pathways is likely to play a central 
role in the process. Once we understand how amyloid assembly 
is controlled in vivo, we can begin to think about strategies to 
reverse the process. In time, these could very well result in the 
identification of anti-amyloid treatments. 

Once Rim4 aggregates into amyloid-like structures and binds 
to target RNAs, it inhibits translation until the completion of 
meiosis I (Figure 7F). At the onset of meiosis II, Rim4 aggregates 
are degraded by an unknown mechanism allowing translation to 
commence. Our studies suggest that the 0-terminal LO se- 
quences are critical for this degradation. This finding raises the 
interesting possibility that Rim4 must be in the amyloid-like 
form to be cleared from cells. This would imply that autophagy 
degrades Rim4 structures at the end of meiosis I. However, we 
have not detected Rim4-EGFP in the vacuole at the time of 
its degradation at the onset of meiosis II, arguing against auto- 
phagy-mediated clearance of Rim4. Rim4 could be eliminated 



(D and E) DAZL forms SDS-resistant aggregates in mouse gametogenesis. Brain, intestine, iiver, and testes were harvested from a 2-month-oid C57BL/6J maie 
mouse. Sampies were Dounce homogenized in SDD-AGE buffer and anaiyzed by SDD-AGE (D) and SDS-PAGE (E) DAZL was detected by immunobiot anaiysis. 
(F) Modei for Rim4-mediated transiationai controi. See text for detaiis. 

See aiso Table S2. 
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by proteasomal degradation, but such a mechanism would likely 
require prior aggregate disassembly. Our previous studies have 
implicated Ime2-mediated phosphorylation in triggering Rim4 
degradation (Berchowitz et al., 2013). Perhaps Ime2 phosphory- 
lation triggers Rim4 disassembly and degradation by the protea- 
some. Amyloids and amyloid-like aggregates are usually stable 
and long-lived. Understanding how they are cleared from cells 
at the appropriate developmental transition leaving Rim4 target 
RNAs intact ready to be translated will be critically important. 

Mechanisms of Translational Repression by Rim4 

Why must Rim4 assemble into amyloid-like particles to repress 
translation? An aggregate-based apparatus rather than a mech- 
anism involving monomer binding could be used to localize the 
translationally inhibited mRNA to certain sites within the cell 
where the protein products are eventually needed. This is un- 
likely to be the case for Rim4. The protein does not show any 
localization preference within the cytoplasm (Berchowitz et al., 
2013). The amyloid-like state could allow for heterotypic interac- 
tions with other proteins with amyloid-like properties that are 
involved in translational repression. Amyloids have the ability 
to influence cross-seeding of other amyloidogenic proteins (Der- 
katch and Liebman, 2007). A computational survey in budding 
yeast identified 179 proteins harboring prion domains many of 
which exhibit canonical properties of amyloid-forming proteins 
(Alberti et al., 2009). Many of these proteins are involved in 
RNA processing and translational control. Cross-seeding be- 
tween proteins with amyloid-like properties could offer a mech- 
anism for the assembly of translationally repressive cytoplasmic 
structures similar to P-bodies (Decker et al., 2007). 

Translational Control Mediated by Amyloid-like 
Aggregates Could Be a Universal Property of 
Gametogenesis 

A significant proportion of RRM-containing proteins also contain 
a prion domain and/or LC sequences. This raises the interesting 
possibility that regulation of RNA functions by amyloid-like 
aggregates is a frequently used mechanism of controlling 
gene expression (Kato et al., 2012; King et al., 2012). In budding 
yeast, ~1 % of genes encode proteins bearing RRM domains 
(52 genes) and ~3% of genes encode proteins containing a prion 
domain (179 genes) (Alberti et al., 2009). Thirteen genes encode 
proteins with both an RRM and prion domain. This is ~1 0-fold 
higher than is expected to occur at random. A similar situation 
exists in humans. The coexistence of RRM and prion domains 
within the same protein is 13-fold higher than what would be 
expected to occur by chance (King et al., 201 2). This association 
has led to the proposal that multimerization via prion domains or 
LC sequences is necessary for the function of many RNA-bind- 
ing proteins. Indeed, several RNA-binding proteins containing 
a prion domain have well-characterized roles in translational 
control such as Ccr4, Pat1 , TIF4632 (elF4G), and Whi3. Under- 
standing under which circumstances an amyloid-like structure 
can promote or repress translation will provide critical insights 
into the pathogenesis of amyloids. 

Remarkably, both yeast and mouse contain an RNA-binding 
protein, Rim4 and DAZL, respectively, that functions in transla- 
tion and forms SDS-resistant aggregates during gametogenesis. 



This finding raises the possibility that DAZL forms amyloid-like 
aggregates to regulate translation. Additionally, it leads to the 
speculation that gametogenic processes as diverse as spore 
formation in yeast and sperm production in mammals are funda- 
mentally related and rely on an ancient process that involves the 
formation of large amyloid-like aggregates. 

EXPERIMENTAL PROCEDURES 

Sporulation Conditions 

Strains were grown to saturation in YPD, diiuted in BYTA (1 % yeast extract, 
2% tryptone, 1% potassium acetate, 50 mM potassium phthaiate) to 
ODeoo = 0.25 and grown overnight. Ceiis were resuspended in sporuiation me- 
dium (0.3% potassium acetate [pH 7.0], 0.02% raffinose) to ODeoo = 1-8 and 
sporuiated with vigorous shaking at 30°C. pGAL-NDT80, GAL4.ER strains 
were reieased from the arrest by the addition of 1 laM (3-estradioi at 6 hr. In 
ndt80/^ strains, expression of pGAL-5' UTRclbsCLBS-SHA was induced with 
1 i^M (3-estradiol at the indicated times. 

Transmission Eiectron Microscopy 

For negative staining, 10 |il of protein solution was applied to a TEM grid 
(Electron Microscopy Sciences), washed with distilled water, and stained 
with several drops of 1 % or 2% (w/v) aqueous uranyl acetate. Excess stain 
was removed and the grids air-dried. TEM images were obtained at 80 kV 
on an FEI Technai Spirit TEM. For cryo-TEM, 10 |al of protein solution was 
applied to a TEM grid, washed with distilled water, and plunged into cooled 
liquid ethane. TEM images were obtained at 1 20 kV on a JEOL 21 00 FEG TEM. 

Semi-denaturing Detergent Agarose Gei Eiectrophoresis 

Samples (2 ml) of sporulating culture were harvested by centrifugation and 
snap frozen in liquid nitrogen. Cell pellets were resuspended in lysis buffer 
(100 mM Tris/HCI pH 8.0, 20 mM NaCI, 2 mM MgCl 2 , 50 mM |3-mercaptoetha- 
nol, 1 % Triton-X, 2x Halt protease inhibitors (Thermo]) and broken using a Fast 
Prep (MP bio) and zirconia beads (BioSpec). The lysates were clarified twice by 
centrifugation at 2,500 relative centrifugal force (ref) for 5 min (4°C). Loading 
buffer (final concentration 0.5x TAE, 5% glycerol, 2% SDS, bromophenol 
blue) was added to lysates that were then incubated for 10 min at room tem- 
perature. Following incubation, samples were separated on a 1 .7% agarose 
gel for 16 hr at 29 V at 4°C using 1 x TAE supplemented with 0.1% SDS as 
the running buffer. The gel was blotted to nitrocellulose by downward capillary 
transfer in 1 x TBS and processed by the immunoblot procedure outlined in the 
Supplemental Experimental Procedures. 

SUPPLEMENTAL INFORMATION 

Supplemental Information includes Supplemental Experimental Procedures, 
six figures, two tables, and six movies and can be found with this article online 
at http://dx.d 0 i. 0 rg/l 0. 1 01 6/j.cell.201 5.08.060. 
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SUMMARY 

Regulated protein degradation is essential. The 
timed destruction of crucial proteins by the CIpXP 
protease drives cell-cycle progression in the bacte- 
rium Caulobacter crescentus. Although CIpXP is 
active alone, additional factors are inexplicably 
required for cell-cycle-dependent proteolysis. Here, 
we show that these factors constitute an adaptor 
hierarchy wherein different substrates are destroyed 
based on the degree of adaptor assembly. The hier- 
archy builds upon priming of CIpXP by the adaptor 
CpdR, which promotes degradation of one class of 
substrates and also recruits the adaptor RcdA to 
degrade a second class of substrates. Adding the 
PopA adaptor promotes destruction of a third class 
of substrates and inhibits degradation of the second 
class. We dissect RcdA to generate bespoke adap- 
tors, identifying critical substrate elements needed 
for RcdA recognition and uncovering additional 
cell-cycle-dependent CIpXP substrates. Our work 
reveals how hierarchical adaptors and primed prote- 
ases orchestrate regulated proteolysis during bacte- 
rial cell-cycle progression. 

INTRODUCTION 

Regulated proteolysis is crucial for all life. For example, the 
timely destruction of key regulators by energy-dependent pro- 
teases during the eukaryotic and bacterial cell cycle drives 
replication and growth (King et al., 1996; Konovalova et al., 
2014). Because proteolysis is irreversible, cells face the sub- 
stantial challenge of stringently distinguishing specific proteins 
that are to be rapidly destroyed from many others meant to 
remain stable. In bacteria, energy-dependent proteases, such 
as the essential CIpXP protease, use accessory factors called 
adaptors to modulate substrate specificity (Gottesman, 2003; 
Ades, 2004; Guo and Gross, 2014). Adaptor proteins often 
work by binding directly to substrates and targeting them to 
appropriate proteases. One of the first characterized adaptors 
is SspB, which enhances CIpXP degradation of incomplete 
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translation products tagged by the ssrA peptide by binding to 
the peptide tag and tethering the substrate to the protease, 
catalyzing immediate destruction (Dougan et al., 2003; Lev- 
chenko et al., 2003). In this case, the adaptor enforces rapid 
degradation by increasing effective substrate concentration 
through tethering (Kirstein et al., 2009; Battesti and Gottesman, 
2013a). Adaptor proteins can themselves be regulated by anti- 
adaptors, proteins that block adaptor activity during different 
growth or environmental conditions (Battesti and Gottesman, 
2013a). 

The Gram-negative a-proteobacterium Caulobacter crescentus 
has a dimorphic life cycle wherein an obligate transition from 
a swarmer cell stage to a stalked cell stage is driven by degra- 
dation of key regulatory proteins (Poindexter, 1981; Curtis and 
Brun, 2010). The essential transcription factor CtrA inhibits 
replication initiation and is degraded by CIpXP during the 
swarmer to stalked cell transition (Quon et al., 1996; Domian 
et al., 1997; Jenal and Fuchs, 1998). The destruction of CtrA 
promotes assembly of replication initiation machinery at the 
origin in stalked cells and drives the progression of the cell cy- 
cle (reviewed in Marczynski and Shapiro, 2002; Thanbichler, 
2010). Similarly, the developmental regulator TacA, which acti- 
vates transcription of genes involved in stalk biogenesis and 
polar development (Biondi et al., 2006; Radhakrishnan et al., 
2008), is degraded by CIpXP in a cell-cycle-dependent manner 
(Bhat et al., 2013). 

Degradation of both TacA and CtrA depends on the response 
regulator CpdR (Iniesta et al., 2006; Bhat et al., 2013). Previous 
reconstitution experiments have shown that CpdR enhances 
degradation of some CIpXP substrates in a phosphorylation- 
dependent manner by acting as an adaptor to activate substrate 
recognition (Abel et al., 201 1 ; Rood et al., 201 2; Lau et al., 201 5). 
However, addition of the CpdR adaptor alone does not improve 
degradation of CtrA in vitro, but addition of two additional 
factors, RcdA and PopA, promotes degradation of CtrA by 
CIpXP in a second messenger and phosphorylation-dependent 
manner (Smith et al., 2014). RcdA and PopA were originally 
identified as necessary for cell-cycle-dependent CtrA degrada- 
tion in vivo (McGrath et al., 2006; Duerig et al., 2009) and are 
thought to obligately interact with each other (Duerig et al., 
2009; Ozaki et al., 2014). Interestingly, orthologs of CpdR and 
RcdA are found in most a-proteobacteria, but PopA is restricted 
to Caulobacter and closely related species (Brilli et al., 2010; 
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Ozaki et al., 2014), suggesting that CpdR and RcdA may work 
independently of PopA. How these factors mechanistically 
function to promote degradation of substrates by CIpXP has 
remained a mystery. 

Here, we show that a hierarchy of adaptors promotes selec- 
tive protein degradation dependent on the degree of adaptor 
assembly with the key finding that RcdA is an adaptor that 
requires a primed protease. First, we identify a PopA-indepen- 
dent function for the RcdA adaptor in facilitating cell-cycle- 
dependent degradation of the developmental regulator TacA 
by the CIpXP protease. We reconstitute TacA degradation 
in vitro to show that RcdA and CpdR are necessary and suffi- 
cient to deliver the TacA substrate to CIpXP. Systematic 
dissection of TacA uncovers a minimal element that directs 
RcdA binding and a sequence motif needed for protease 
recognition. We find that RcdA contains a substrate-binding 
domain and a tethering motif that leashes the adaptor to CIpXP 
only when the protease is first primed by the CpdR adaptor. 
We find that these domains are modular and design chimeric 
adaptors with altered target specificity and protease selectivity. 
We identify additional RcdA-dependent CIpXP substrates, 
demonstrating that the RcdA adaptor can bind and deliver a 
range of substrates. Finally, we show how PopA works as 
both an adaptor and anti-adaptor of RcdA by inhibiting degra- 
dation of some substrates and promoting degradation of 
others, dependent on ligand binding. This work establishes a 
class of protease adaptors that tether substrates and additional 
adaptors selectively to a primed protease, revealing how an 
adaptor hierarchy enforces the selective protein destruction 
that drives the cell cycle. 



Degradation of the TacA Substrate 
Requires CpdR and RcdA but Not 
PopA 

We previously identified TacA as a CIpXP 
substrate that requires CpdR for its cell- 
cycle-dependent degradation (Bhat et al., 2013) (Figure IB). 
During our characterization of this result, we found that expres- 
sion of a nondegradable TacA allele (TacA-DD, Bhat et al., 201 3) 
resulted in cells with longer stalks compared to wild-type 
(WT) cells (Figure 1 A), consistent with the role of TacA in stalk 
biogenesis. Cells lacking RcdA were previously reported to 
have increased stalk length (McGrath et al., 2006), and quantita- 
tive measurements showed that these increased lengths were 
statistically indistinguishable from those in the TacA-DD strain 
(Figure 1A), suggesting that RcdA might be needed for proper 
TacA degradation. In accordance with this model, the cell-cy- 
cle-dependent degradation of TacA is lost in ArcdA strains 
(Figure IB). 

Prior work suggested that RcdA requires the PopA protein for 
its biological functions, such as the cell-cycle-dependent degra- 
dation of CtrA (Duerig et al., 2009) (Figure 1 B). Indeed, the regu- 
lated degradation of CtrA has been recently reconstituted 
in vitro, and PopA was necessary to forming a complex with 
both RcdA and CtrA as part of this process (Smith et al., 2014). 
Furthermore, all CIpXP substrates that rely on RcdA for cell-cy- 
cle-dependent degradation also require PopA (Duerig et al., 
2009; Radhakrishnan et al., 2010). Thus, we were surprised to 
find that although TacA degradation in vivo was dependent on 
RcdA, degradation was unaffected in a ApopA strain (Figure 1 B). 
These results show an unexpected role for RcdA separate from 
PopA in driving cell-cycle-dependent CIpXP proteolysis, a 
feature that we next explored biochemically. 

Our original identification of TacA as a CIpXP substrate 
showed that it was slowly degraded by CIpXP in vitro and 
required CpdR for in vivo degradation (Bhat et al., 2013). 
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Because the CpdR adaptor delivers substrates such as PdeA to 
the CIpXP protease (Abel et. al., 2011; Rood et al., 2012; Lau 
et al., 2015), we first tested whether CpdR could stimulate 
TacA degradation by CIpXP as well. However, TacA degradation 
was unchanged in the presence of CpdR alone (Figure 2A). We 
next asked whether addition of both RcdA and CpdR affected 
TacA degradation in vitro. Consistent with our in vivo observa- 
tions (Figure IB), TacA degradation by CIpXP was dramatically 
accelerated (>20-fold decrease in half-life) in the presence of 
both CpdR and RcdA (Figures 2A and SIB). This stimulation 
was specific as degradation of an unrelated substrate (GFP- 
ssrA) by CIpXP was unaffected by the addition of RcdA and 
CpdR (Figure SIC). RcdA and CpdR failed to deliver TacA with 
mutations on the extreme C-terminal residues to di-aspartate, 
suggesting that the native C terminus constitutes the protease 
recognition motif (Figure 2A) (Bhat et al., 2013). Taken together, 
these results confirm our in vivo findings that RcdA has a PopA- 
independent role in promoting degradation of certain CpdR- 
dependent CIpXP substrates. 

The RcdA Adaptor Binds the TacA Substrate Directly to 
Promote Degradation 

If CpdR and RcdA together make up a substrate-specific 
adaptor system for delivery of TacA to CIpXP, it seems reason- 
able that one of the two should interact directly with TacA. Based 
on size-exclusion chromatography, we found that the RcdA 
adaptor could directly bind to TacA (Figure 2B). TacA is an 
NtrC-family response regulator that contains an N-terminal 
receiver domain (RD), an ATP-binding AAA^ domain (AAA), and 
a C-terminal DNA-binding domain (DBD) (Figure 2C). To deter- 
mine which region of TacA is needed for RcdA-dependent 
recognition and delivery, we generated a series of TacA trunca- 
tions. Given that the degradation of the response regulator CtrA 
by CpdR/RcdA/PopA relies on its N-terminal receiver domain 
(Ryan et al., 2002), we expected a similar reliance for TacA. Sur- 
prisingly, the RD region was dispensable for adaptor-dependent 
proteolysis as fragments containing only the DBD bind to RcdA 
(Figure SID) and are degraded similarly to full-length protein 
(Figure 2C). Further truncations and fusions revealed that a re- 
gion of 12 residues (437-448) within TacA was necessary for 
RcdA binding (Figures 2C and 2D). In total, these results support 
a model wherein the C-terminal domain of TacA contains both 
RcdA and CIpXP recognition motifs that ensure robust degrada- 
tion of TacA in a CpdR-dependent fashion (Figure 2E). We next 
sought to understand how the RcdA adaptor delivers the TacA 
substrate to the CIpXP protease and how this delivery depends 
on the adaptor CpdR. 

The RcdA Adaptor Contains a Tethering Motif Needed 
for Substrate Delivery to the CpdR-Primed CIpXP 
Protease 

Previous work had shown that the disordered C-terminal tail of 
the RcdA adaptor plays a critical role in the regulated degrada- 
tion of the CtrA substrate in vivo (Taylor et al., 2009). To deter- 
mine whether the C terminus of RcdA is also important for 
TacA degradation, we expressed a variant of RcdA lacking 19 
residues from the C terminus (RcdAAC) as the sole copy in vivo 
and monitored levels of TacA and CtrA during synchronous 



growth. As expected, CtrA levels were stable throughout the 
cell cycle (Taylor et al., 2009) (Figure 3A). TacA levels also remain 
stable in this background, whereas the non-RcdA-dependent 
CIpXP substrate McpA was still degraded in a cell-cycle-depen- 
dent manner (Figure 3A). Consistent with these in vivo results, 
RcdAAC did not stimulate degradation of TacA in vitro even in 
the presence of CpdR (Figure 3B). Interestingly, RcdAAC is still 
capable of binding TacA (Figures 3C, S2A, and S2B), suggesting 
that the defect in TacA degradation is downstream of substrate 
binding. This architecture is reminiscent of the SspB adaptor, in 
which an N-terminal domain binds ssrA-tagged substrates and 
a CIpX-binding motif at the C terminus anchors the adaptor to 
the protease (Dougan et al., 2003; Levchenko et al., 2003). 
Therefore, we next sought to characterize how the C terminus 
of the RcdA adaptor contributes to substrate delivery. 

The C terminus of RcdA harbors a number of hydrophobic res- 
idues with spacing consistent with an amphipathic helix (Figures 
3D and S2C). Mutation of these residues inhibited RcdA-depen- 
dent delivery of the TacA substrate, consistent with a model in 
which the hydrophobic face of the putative amphipathic helix is 
needed for delivery. This result suggested that the RcdA tail 
acts as a binding element for CpdR, CIpX, or both as part of 
the mechanism by which the RcdA adaptor delivers substrates 
to CIpXP. Recently, we showed that CpdR acts a priming 
adaptor, binding to the N-terminal domain of CIpX to enhance 
recruitment of substrates such as PdeA and McpA (Lau et al., 
2015). We considered that the C-terminal tail of the RcdA 
adaptor might also be selectively binding to a CpdR-primed 
CIpXP protease, promoting substrate delivery. 

We first tested this model by fusing the C-terminal 1 9 residues 
of RcdA to SspB lacking its CIpX-binding motif to generate a 
SspB~RcdA chimera (Figure 3E). Our rationale was that this 
chimeric construct would bind ssrA-tagged substrates similar 
to SspB but with the protease-binding specificity of RcdA. We 
used an SspB-obligate substrate (GFP-ssrA-SS) that requires 
an adaptor for CIpXP degradation (Lau et al., 2015) (Figure 3F). 
Consistent with our model that the RcdA C-terminal tether 
relies on a CpdR-primed CIpXP protease, the SspB~RcdA 
chimeric adaptor fails to deliver GFP-ssrA-SS to CIpXP alone 
but robustly promotes substrate degradation when CpdR is 
added (Figure 3F). 

Next, we directly measured RcdA binding to its protease 
partner by following anisotropy of a fluorescently labeled pep- 
tide consisting of the last 19 residues of RcdA. This peptide 
bound poorly to either CpdR or CIpX alone, but addition of 
both resulted in strong peptide binding (Figure 3G). Excess 
RcdA competitively inhibited peptide binding, but RcdAAC 
could not compete, consistent with the above results illus- 
trating that the C terminus of RcdA is the major contact with 
the CpdR-primed CIpX (Figures 3G and S2D). CpdR binds 
the unique N-terminal domain of CIpX as part of its delivery 
mechanism (Lau et al., 2015), and we found that RcdA peptide 
binding and substrate delivery to the CpdR-CIpX complex also 
require this region (Figures 3G and S2E). Taken together, these 
results support a model in which the CpdR adaptor binds to 
the N-terminal domain of CIpX, priming it to recognize the 
C-terminal disordered region of RcdA and enhance degrada- 
tion of the TacA substrate. 
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Figure 2. RcdA Binds the TacA Substrate Directly to Promote Degradation by CIpXP 

(A) RcdA and CpdR collaborate to accelerate TacA degradation in vitro. Degradation reactions were performed using either RcdA or CpdR alone or together. 
Reaction containing both RcdA and CpdR but not CIpX served as a control. Reactions consisted of 1 ^iM TacA, 1 ^iM TacA-DD, 1 ^iM RcdA, 2 ^iM CpdR, 0.4 ^iM 
CIpXe, and 0.8 |iM ClpPi 4 when indicated. See also Figures SI A and SIB. 

(B) RcdA directly binds to TacA. Hise-TacA or Hise-RcdA were loaded either alone or together onto the analytical size-exclusion column. Collected fractions were 
resolved by SDS-PAGE gels. Protein standards used to calibrate the column are indicated above. 

(C) C-terminal region of TacA contains the necessary recognition element for RcdA/CpdR-mediated degradation. Both N- and C-terminally truncated variants of 
TacA as indicated by residue numbers were generated and subjected to RcdA/CpdR-mediated CIpXP degradation. Degradation reaction consisted of 8 |xM TacA 
(2-116), 1 |4M TacA (117-488), 3 i^M TacA (312-488), 2 laM hiSeSUMO-TacA (437-488), 2 laM hiSeSUMO-TacA (449-488), 1 laM RcdA, 2 i^M CpdR, 0.2 ^lM CIpXe, 
and 0.4 laM ClpPi 4 when indicated. Arrows indicate bands corresponding to the TacA variants. 

(D) RcdA binds the C terminus domain of TacA containing residues 437-488 but not a domain lacking residues 437-448. RcdA- or SUMO-appended TacA 
(437-488) or TacA (449-488) variants were loaded either alone or together onto the analytical size-exclusion column. Collected fractions were resolved by 
SDS-PAGE gels and stained by Coomassie. Protein standards used to calibrate the column are indicated above. See also Figure S1 D. 

(E) Cartoon of working model wherein RcdA binds TacA and delivers it to a CpdR-bound CIpXP. 
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Tethering of the RcdA Adaptor to CIpX Bypasses the 
Need for CpdR 

Our results suggested that tethering the RcdA adaptor to a 
CpdR-primed CIpXP complex is needed for TacA degradation. 
If this model is correct, then we should be able to constitutively 
activate RcdA as an adaptor by directly tethering it to CIpX. 
We fused the CIpX-binding motif of SspB to the RcdAAC variant 
to generate the RcdAAC~XB chimera. In accordance with our 
model, RcdAAC~XB was able to stimulate the degradation of 
TacA without the need for CpdR (Figure 4A). Strains lacking 
CpdR have higher steady-state levels of TacA than WT strains, 
as there is a loss of CpdR-dependent TacA degradation (Figures 
4B and 4C). Importantly, expression of RcdAAC~XB in this 
background reduces TacA to WT levels (Figure 4B) due to a re- 
covery of TacA degradation in a CpdR-independent manner 
(Figure 4C). 

RcdA is known to interact with PopA, a cyclic di-GMP (cdG) 
binding protein (Duerig et al., 2009). We recently showed that 
RcdA, PopA, and CpdR form a multi-protein adaptor complex 
in the presence of cdG that delivers CtrA for degradation by 
CIpXP (Duerig et al., 2009; Smith et al., 2014). The ability of 
RcdA to aid in this adaptor complex formation requires CpdR 
(Smith et al., 2014), and we speculated that this extended func- 
tion of RcdA could also be bypassed by RcdAAC~XB (Fig- 
ure 4D). Consistent with this hypothesis, adding RcdAAC~XB, 
PopA, and cdG stimulated the degradation of CtrA even without 
CpdR (Figures 4E and S3). Similar to the recovery of TacA degra- 
dation shown above, degradation of CtrA in vivo could also be 
restored in cells lacking CpdR if RcdAAC~XB was expressed 
(Figure 4F). These results demonstrate that direct tethering of 
RcdA to CIpX is sufficient for delivery of substrates to CIpXP 
and to assemble a multi-protein adaptor complex for CtrA degra- 
dation. Thus, priming of CIpX by the CpdR adaptor contributes to 
CtrA/TacA degradation principally by generating a tethering site 
for the RcdA adaptor. 

Proteomic Identification of Additional RcdA-Dependent 
CIpXP Substrates 

Cur working model is that the RcdA adaptor recognizes protease 
substrates via an N-terminal substrate-binding domain and en- 
gages a CpdR-primed CIpX via its disordered C-terminal region. 
We speculated that other substrates in addition to TacA may also 
be delivered to CIpXP in a similar fashion and used a proteomics 
approach to identify these candidates. We expressed epitope- 
tagged RcdA variants M2-RcdA (which fully complements; Fig- 
ure S4A) and M2-RcdAAC in Arcc/A cells, lysed cells, and precip- 
itated RcdA-interacting proteins using M2-FLAG affinity beads 
(Figure 5A). Cur rationale for using both variants was that targets 
of RcdA should bind both constructs but would be enriched 
in the M2-RcdAAC binding pool due to the cell’s inability to 
degrade those targets. As expected, TacA was enriched in the 
elution fraction of M2-RcdAAC relative to the M2-RcdA elution 
and was absent in mock pull-down experiments (Figure S4B). 
We identified putative RcdA partners by trypsinization/mass 
spectrometry and applied an enrichment filter to prune the 
candidate pool (see Experimental Procedures and Supplemental 
Experimental Procedures). We focused on two proteins of un- 
known function that were strongly enriched by this approach 



(see Experimental Procedures and Supplemental Experimental 
Procedures). CC2323 was a protein that we had previously iden- 
tified as a CIpXP substrate based on a CIpP trapping approach 
(Bhat et al., 2013), and CC3144 was a protein that was enriched 
as strongly as TacA in our RcdA pull down (see Experimental 
Procedures and Supplemental Experimental Procedures). 

We found that CC3144 protein levels oscillated in a cell-cycle- 
dependent manner similar to TacA and CtrA (Figure 5B). Impor- 
tantly, purified CC3144 was only robustly degraded upon 
addition of CpdR and RcdA (Figure 5C). CpdR and RcdA were 
both required for CC3144 degradation in vivo, but as in the 
case for TacA, PopA was dispensable for CC3144 degradation 
(Figure 5B). Similarly, RcdA forms a tight complex with purified 
CC2323 in vitro (Figure S4C), and degradation of this protein 
by CIpXP was enhanced by addition of CpdR and RcdA (Fig- 
ure 5D), although an M2-tagged variant of CC2323 did not mirror 
TacA or CC3144 degradation in vivo, possibly due to disruption 
of the protein by the tag (Figure S4D). T aken together, these data 
reveal that the RcdA adaptor can bind and deliver a number of 
substrates in addition to TacA (Figure 5E). 

Adaptors Can Also Act as Anti-adaptors 

Adaptors can be controlled by anti-adaptors, as shown by the 
complex regulation of the RssB adaptor by the Ira family of pro- 
teins (Bougdour et al., 2006; Battesti et al., 2013b). Cutside of 
RssB, few examples of anti-adaptors have been described; 
however, we considered that RcdA could be subject to anti- 
adaptor regulation as it can clearly bind diverse partners 
(Figure 5). In particular, PopA must be bound to cdG in order 
to facilitate degradation of CtrA by CpdR/RcdA/CIpXP (Fig- 
ure S3) (Smith et al., 2014), but prior work suggests that PopA 
and RcdA still bind in the absence of cdG (Duerig et al., 2009; 
Smith et al., 2014). Therefore, we speculated that PopA could 
act as an inhibitory anti-adaptor for RcdA-specific substrates 
in addition to its stimulatory role in activating CtrA degradation 
(Figure 6A). Indeed, steady-state levels of TacA and CC3144 
are slightly lower in ApopA cells (Figures 6B and S5A). Consis- 
tent with an anti-adaptor role for PopA, TacA is degraded more 
rapidly in ApopA cells (Figures 6C, S5B, and S5C). Excess 
PopA suppresses RcdA-dependent TacA degradation in vitro 
even in the absence of cdG (Figure 6D), whereas addition of 
cdG promotes CtrA degradation (Figure 6E). Thus, PopA can 
be both an adaptor of RcdA that enhances CtrA degradation 
and an anti-adaptor of RcdA that blocks TacA degradation. Simi- 
larly, excess RcdA inhibits degradation of the PdeA substrate 
in vitro, and steady-state levels of the McpA substrate are lower 
in the absence of RcdA and PopA (Figures S5D and S5E), consis- 
tent with a general model wherein members of the adaptor 
hierarchy can act as anti-adaptors for substrates at a lower 
hierarchical level. 

DISCUSSION 

Protein degradation is essential but must be exquisitely 
controlled so that off-target proteins are not destroyed, as unre- 
strained proteolysis is lethal to cells (Brotz-Cesterhelt et al., 
2005). In Caulobacter crescentus, the regulated destruction of 
cell-cycle factors by the essential protease CIpXP coordinates 
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Figure 3. RcdA Contains a Tethering Motif Needed for Substrate Delivery to the CpdR-Primed CIpXP 

(A) Cell-eyele-dependent degradation of TaoA requires the C-terminal region of RedA. Strain expressing RedAAC (RedA laeking the 1 9-residue C-terminal region) 
was synehronized, and isolated swarmer eells were released into fresh PYE medium. Constant volume of samples was withdrawn at indieated time points for 
western blot analysis. Eaeh sample was probed with anti-TaoA, anti-CtrA, anti-MepA and anti-CIpP antibodies. 

(B) TaoA degradation is not stimulated by RedAAC in vitro even in the presenee of CpdR. Degradation reaetions were performed with either purified full-length 
RedA or RedAAC in the presenee of CpdR. Degradation reaetions were similar to those in Figure 1 A exeept that 1 laM RedAAC was used. 

(C) RedAAC direetly binds the C-terminal domain of TaoA. HiSe-RedAAC was used as a bait protein to pull down TaeA (312-488) by using Ni-NTA affinity resin. 
See also Figures S2A and S2B. 

(D) Hydrophobie residues in the C-terminal region of RedA are important for RedA-mediated delivery of TaeA. Residues LI 60, LI 63, and FI 67 were mutated to 
aspartate, and residues D161 and R162 were mutated to alanines (arrows). RedAD161A,R162A was used as a eontrol. See also Figure S2C. TaeA degradation 
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replication and development. Genetic studies pointed to several 
key factors necessary to mediate degradation in vivo, but it was 
unclear how they mechanistically coordinated regulated degra- 
dation. Part of this uncertainty arose from the complex relation- 
ship between these factors and substrate degradation, wherein 
different factors were needed for degradation of different sub- 
strates. Although CIpXP is active on its own, the response 
regulator CpdR is required for degradation of all known cell- 
cycle-dependent CIpXP substrates in vivo (Iniesta et al., 2006; 
Abel et al., 201 1 ; Bhat et al., 2013), and the proteins RcdA and 
PopA are needed for degradation of only a subset of these tar- 
gets (Duerig et al., 2009; Radhakrishnan et al., 2010). We found 
previously that the adaptor CpdR directly facilitates degradation 
of some CIpXP substrates, such as the phosphodiesterase PdeA 
(Rood et al., 2012; Lau et al., 2015); however, the CpdR adaptor 
alone is insufficient to enhance degradation of other substrates 
such as CtrA (Smith et al., 2014). Here, we show that RcdA is 
an adaptor that tethers proteins to the CIpXP protease only 
when the protease is first primed by CpdR. The RcdA adaptor 
binds the protease substrates TacA, CC2323, and CC3144 to 
deliver them to a CpdR-primed CIpXP. RcdA also binds the 
PopA adaptor, which promotes recruitment of additional sub- 
strates, such as CtrA, to the primed protease (Figure 7). 

Implications for Adaptor and Substrate Discovery 

Traditionally, an adaptor protein could be recognized by its abil- 
ity to bind substrate directly and to physically interact with the 
protease to deliver the substrate. Examples of CIpXP adaptors 
include SspB, which binds and delivers ssrA-tagged substrates, 
and RssB, which delivers the RpoS substrate to CIpXP (Zhou 
et al., 2001; Dougan et al., 2003). For both cases, the definitive 
proof that these were adaptors was that they were able to 
enhance degradation of their substrates by CIpXP in vitro. It is 
inherently difficult to identify new adaptors because without 
prior knowledge of their substrates, designing an experiment 
to test adaptor activity is almost impossible. For example, in 
this work we found that RcdA was able to deliver substrates 
for CIpXP-mediated degradation. However, RcdA cannot 
deliver substrates to CIpXP alone but requires a CpdR-primed 
CIpXP. Had we not initially identified CpdR as an adaptor of 
CIpXP during our characterization of PdeA degradation (Abel 
et al., 2011; Lau et al., 2015), our efforts to reconstitute the 
adaptor activity of RcdA would have been futile. This example 
demonstrates a central difficulty in adaptor/substrate discovery. 
Namely, a candidate is proven to be an adaptor by monitoring 
the degradation of a specific substrate, but reconstituting 
substrate degradation requires knowing that the adaptor or 
other additional factors are needed in the first place. Systematic 



approaches to identifying new adaptor/substrate pairs are 
needed. 

Proteolytic Regulation by Hierarchical Assembly of 
Adaptors 

Protein degradation is a tightly regulated process that is needed 
for cell-cycle progression. In eukaryotes, the 26S proteasome 
specifically recognizes ubiquitinated proteins generated by the 
stage-specific activity of APC/C and SCF ubiquitin ligases. As 
bacteria lack ubiquitination, they often employ adaptors to pro- 
mote rapid protein degradation. How this process can accommo- 
date the hundreds of substrates degraded during the cell cycle 
(Duerig et al., 2009; Radhakrishnan et al., 201 0; Bhat et al., 201 3; 
Smith et al., 2014) while maintaining specificity is a substantial 
challenge. One way bacteria can overcome this challenge is to 
employ hierarchical control of proteases, tightly regulating the 
activity of a single protein during the cell cycle that initiates 
a cascade of proteolytic events. In the case of Caulobacter 
crescentus, cell-cycle-dependent phosphorylation of CpdR 
gates its ability to prime CIpX for recruitment of substrates such 
as PdeA (Abel et al., 2011; Lau et al., 2015). When activated, 
CpdR binding to CIpX also promotes engagement of the RcdA 
adaptor that directly delivers some substrates (TacA, CC2323, 
CC3144) and indirectly delivers substrates such as CtrA through 
recruitment of additional adaptors such as PopA. Thus, which 
substrate is degraded depends on the degree of the adaptor 
hierarchy that has assembled at that time, which could aid in pri- 
oritization of substrate destruction, e.g., CtrA is only degraded 
when the complete adaptor hierarchy is assembled (Figure 7). 

When bound to cdG, PopA acts as an adaptor to promote 
CtrA degradation, but in the absence of cdG, PopA can moon- 
light as an anti-adaptor for RcdA-dependent protease sub- 
strates such as TacA. Similarly, excess RcdA adaptor can inhibit 
degradation of PdeA presumably by competing for a limited 
amount of CpdR-primed CIpXP. We speculate that a general 
feature of adaptor hierarchies is that adaptors operating at a 
higher level of the hierarchy can serve as anti-adaptors for 
substrates reliant only on the lower levels of the hierarchy. 
An intriguing corollary to this model is that other known anti- 
adaptors (such as the Ira family of proteins that block the RssB 
adaptor) could also moonlight as adaptors and aid in the delivery 
of as yet unknown substrates. 

The Role of Adaptor Hierarchies in Bacterial 
Development 

Finally, we again note that CpdR and RcdA are conserved 
throughout a-proteobacteria, whereas the presence of PopA 
appears restricted to Caulobacter and closely related species 



(E) Cartoon depicting SspB, RcdA, and SspB-^RcdA chimeric proteins and how deiivery of GFP-ssrA-SS substrate is affected by adaptors. Residue numbers 
indicate boundaries used to construct the chimeric protein. 

(F) SspB-^RcdA chimera deiivers GFP-ssrA-SS to CipXP in a CpdR-dependent manner. GFP-ssrA-SS was subjected to SspB--RcdA chimera-mediated 
degradation in the presence or absence of CpdR. Loss of GFP fiuorescence was monitored over time. Reaction consisted of 2 ^iM GFP-ssrA-SS, 0.5 ^iM SspB, 
0.5 i^M SspB (1-125), 1 i^M SspB-RcdA chimera, 4 i^M CpdR, 0.4 |iM CipXe, and 0.8 laM CipPi 4 when indicated. 

(G) The isoiated RcdA C-terminai peptide binds to CpdR-primed CipX but not CipX variant iacking the N-terminai domain. Fiuorescence anisotropy was 
measured for the binding of 40 nM FiTC-iabeied, C-terminai 19-residue peptide of RcdA to 4 laM CpdR, 1 laM CipX© or 1 laM ANCipXe, 10 i^M RcdA, and 10 i^M 
RcdAAC, either aione or in different combinations as indicated. 

Data represent mean ± SD. See aiso Figures S2D and S2E. 
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Figure 4. Direct Tethering of RcdA to CIpX Is Sufficient for Delivery of the TacA Substrate 

(A) RcdAAC^XB alone delivers TacA to CIpXP for degradation in vitro. Schematic depicting different fusion constructs is shown. Degradation reactions were 
performed with RcdAAC^XB alone or with RcdA alone or in the presence of CpdR. Degradation reactions consisted of 1 pM TacA, 1 pM RcdAAC-^XB, 1 pM 
RcdA, 2 pM CpdR, 0.4 pM CIpXe, and 0.8 pM ClpPi 4 when indicated. 

(B and C) RcdAAC-^XB delivers TacA for degradation in the absence of CpdR in vivo. Steady-state levels of TacA were measured in AcpdR cells expressing 
rcdAAC-^XB, rcdA or \NJ cells containing the empty vector. Exponential phase cells were induced with 0.003% xylose for 1 hr. Lysate from equal numbers of cells 
was used for western blot analysis to probe using anti-TacA antibody. Cropped image of the blot from AtacA lysate is shown. Protein synthesis was inhibited 
during exponential growth with chloramphenicol. Samples were withdrawn at indicated time points and normalized to CD before western blot analysis. Asterisk 
indicates a cross-reacting band. 

(D and E) RcdAAC-^XB in conjunction with PopA and cdG delivers a GFP-CtrA reporter to CIpXP independent of CpdR in vitro. Cartoon depicting RcdAAC-^XB/ 
PopA/cdG-mediated GFP-CtrA-RD+15 delivery to CIpXP is shown. Reactions for monitoring loss of fluorescence from GFP-CtrA-RD+15 degradation were 
carried out with RcdA alone, RcdA with PopA/cdG, RcdAAC-^XB alone, or RcdAAC-^XB with PopA/cdG. Degradation reactions consisted of 1 pM GFP-CtrA- 
RD+15, 1 pM RcdAAC--XB, 1 pM RcdA, 2 pM CpdR, 1 pM PopA, 20 pM cdG, 0.4 pM CIpXe, and 0.8 pM ClpPi 4 when indicated. See also Figure S3. 

(F) RcdAAC^XB promotes CtrA degradation in the absence of CpdR in vivo. Experiments were performed as in panel C, except that blots were probed with anti- 
CtrA and anti-CIpP antibodies. CIpP was used as a loading control. 

(Brilli et al., 201 0; Ozaki et al., 201 4). As PopA is responsive to the 
second messenger cyclic di-GMP, levels of which oscillate dur- 
ing the cell cycle (Abel et al., 2013; Lori et al., 2015), our work 
suggests that CpdR and RcdA may represent an ancestral 
adaptor complex that has been co-opted by Caulobacter for 
coupling second messenger cues to cell-cycle progression. In 
S. meliloti, it is clear that adaptors play a crucial role in the sym- 
biosis transition, wherein misregulation of CpdR dramatically 



affects proper nodule formation and plant growth (Kobayashi 
et al., 2009; Pin! et al., 2015; Schallies et al., 2015). In general, 
bacterial development (such as the morphological transition 
of Caulobacter or sporulation of Bacillus) requires changes 
in proteome composition within a single generation. Adaptor 
hierarchies could robustly promote these changes while main- 
taining the control needed to selectively degrade proteins in a 
specific order or priority. 
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Figure 5. Proteomic Identification and Validation of RcdA-Dependent CIpXP Substrates 

(A) In vivo pull downs reveal additional binding partners of RcdA. M2-FI_AG affinity resin was used to pull down M2-epitope tagged RcdA or RcdAAC from lysates 
of Arcc/A cells expressing either M2-RcdA or M2-RcdAAC. Lysate from cells expressing M2 peptide alone was used as a control. Cropped images of silver- 
stained SDS-PAGE gels from elution pool of M2, M2-RcdA, and M2-RcdAAC are shown. Protein markers are indicated. See also Figures S4A and S4B. 

(B) CC31 44 is degraded in a cell-cycle-dependent manner requiring RcdA and CpdR but not PopA. Cultures of WT or cells lacking RcdA, CpdR, and PopA were 
synchronized, and swarmer populations were released into fresh M2G medium. Constant volume of samples were withdrawn at indicated time points and probed 
with anti-CC3144 and anti-CtrA antibodies. 

(C and D) CIpXP-mediated degradation of CC31 44 or CC2323 is enhanced in the presence of RcdA and CpdR. Reactions consisted of 1 i^M CC31 44 or CC2323, 
1 |xM RcdA, 2 i^M CpdR, 0.4 ^iM CIpX©, and 0.8 ^iM ClpPi 4 when indicated. In (D), CC2323 degradation was visualized by Coomassie staining. In (C), overlapping 
bands made it necessary to use western blotting to detect the purified CC3144 using anti-CC3144 antibody. Asterisks denote cross-reacting bands. See also 
Figure S4C. 

(E) Cartoon of RcdA delivering diverse substrates to a CpdR-primed CIpXP. 
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Figure 6. Adaptors Can Also Act as Anti-adaptors 

(A) Cartoon depicting PopA-mediated inhibition of TacA degradation and PopA/edG-dependent stimuiation of CtrA degradation. 

(B) Steady-state ieveis of TacA and CC3144 are iower in ApopA ceiis compared to WT, ArcdA, or Aepdf? ceiis. Lysates from equai ODeoo of WT, ApopA, Arcc/A, 
and Acpc/f? strains were used for western biot anaiysis, probing with anti-TacA, anti-CC3144, anti-CtrA, and anti-MreB antibodies. MreB was used as a ioading 
controi. See aiso Figure S5A. 

(C) TacA is degraded more rapidiy in ApopA ceiis compared to WT. WT and ApopA ceiis bearing M2-epitope-tagged TacA were grown at exponentiai phase and 
subsequentiy induced with 0.003% xyiose for 1 hr. Protein synthesis was inhibited by addition of chioramphenicoi. Sampies were withdrawn at indicated time 
points and normaiized to CD before performing western biot anaiysis using anti-TacA and anti-CipP antibodies. CipP was used as a ioading controi. See aiso 
Figures S5B and S5C. 

(D) Excess PopA suppresses TacA degradation independent of cdG in vitro. Reactions consist of 1 |tM TacA, 1 laM RcdA, 2 laM CpdR, 5 |tM PopA, 100 |tM cdG, 
0.4 |tM CipXe, and 0.8 |tM CipPi 4 when indicated. 

(E) PopA/CpdR/RcdA acceierates CtrA degradation in a cdG-dependent manner in vitro. Reaction conditions are simiiar to those in Figure 6D, except that 1 laM 
PopA, 20 |tM cdG, and 10 |tM Pf^p were used. 



EXPERIMENTAL PROCEDURES (100 |ig/mi ampiciiiin, 50 irg/mi kanamycin, 50 irg/mi spectinomycin). 

Caulobacter strains were grown in Peptone-Yeast -Extract (PYE) media at 
Bacterial Strains and Growth Conditions 30°C with 25 irg/mi spectinomycin or 5 |ig/mi kanamycin, wherever required 

Bacteriai strains used in the study are tabuiated in Tabie S2. E. coli strains and suppiemented with xyiose to induce gene expression as indicated in the 

were grown in Luria-Broth (LB) at 37°C with the appropriate antibiotic figure iegends. 
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Figure 7. A Model for Adaptor Hierarchies Orchestrating Proteolysis 

Priming of the CipXP protease by the CpdR adaptor enhances degradation of the substrates McpA and PdeA. The CpdR-primed CipXP compiex can aiso recruit 
the RcdA adaptor. RcdA binds directiy to the substrates TacA, CC3144, or CC2323 and deiivers them to the CpdR-primed CipXP protease for degradation. The 
PopA protein acts as an anti-adaptor by binding the RcdA adaptor and inhibiting degradation of the substrate TacA. in response to cdG, PopA aiso acts as an 
adaptor of RcdA to deiiver the substrate CtrA to the CpdR-primed CipXP protease. This hierarchicai formation of adaptors, anti-adaptors, and primed proteases 
dictates ceii-cycie-dependent protein degradation. 



Cloning, Mutagenesis, and Protein Purification 

Caulobacter strains expressing TacA and TacA-DD were used as described 
previousiy (Bhat et al., 2013). Caulobacter strain harboring the rcdA/lC 
aiieie was used as described by Tayior et al. (2009). M2-tagged RcdA or 
M2-tagged RcdAAC was generated in pENTR plasmid and transferred into 
xylose-inducible expression plasmids using Gateway-based cloning (Skerker 
et al., 2005). Truncated variants of TacA were generated by amplifying the 
desired region of TacA, as indicated in the figures, using round-the-horn 
site-directed mutagenesis or Gibson assembly method (Gibson et al., 2009) 
with pET23SUMO as template. CC3144 and CC2323 were amplified with 
appropriate primers using genomic DNA from C. crescentus as template 
and then cloned into pTE28a and pET23SUMO expression plasmid, respec- 
tively. SspB-RcdA chimeric protein was generated by replacing 10 C-termi- 
nal residues of SspB with 19 C-terminal residues of RcdA. The RcdAAC--XB 
fusion construct was generated in pET28a vector by designing appropriate 
primers to append the last 10 residues of SspB to RcdAAC. All the 
constructs were confirmed by sequencing. Oligonucleotide sequences are 
available upon request. 

BL21(DE3) E.coli cells bearing expression plasmid for different proteins 
were grown till the optical density 600 (ODeoo) reached -^0.4 to 0.8, induced 
with 0.4 mM IPTG for 3-5 hr and then centrifuged at 5,000 rpm for 15 min. 
Pellets were resuspended in lysis buffer containing 50 mM Tris (pH 8.0), 
300 mM NaCI, 10 mM imidazole, 10% glycerol, and 5 mM (3-mercaptoethanol 
and frozen at -80°C until further use. Cells were lysed using a Microfluidizer 
system (Microfluidics, USA). The lysate was applied onto a Ni-NTA column 
for affinity purification. SUMO-tagged proteins were cleaved by Ulpl-his 
protease (Rood et al., 2012). Proteins were further purified by size-exclusion 



and anion-exchange chromatography using Sephacryl 200 16/60 and MonoQ 
5/50 columns. CIpX and CIpP were purified as described previously (Chien 
et al., 2007). Detailed purification protocols are available upon request. 

Synchronization and In Vivo Protein Stabiiity Assays 

Synchronization experiments were performed by growing Catv/obacfe/' strains 
to an ODeoo of 0.3-0. 5 in PYE or M2G media with appropriate antibiotic and 
xylose when required, as indicated in the figure legends. Swarmer cells were 
isolated using Percoll density gradient centrifugation. The isolated swarmer 
cells were then released into fresh PYE or M2G minimal media for progression 
through the cell cycle. In vivo protein stability assays were performed by 
growing WT or Caulobacter cells, expressing different constructs from a 
xylose-inducible plasmid, to an ODeoo of ^0-3 in PYE medium containing 
25 |ig/ml spectinomycin. Protein expression was induced with 0.003% xylose 
for 1 hr wherever indicated in figure legends, and then protein synthesis was 
blocked by addition of 30 |ag/ml chloramphenicol. 

Microscopy 

Phase-contrast microscopy was performed on glass slides layered with a 1 % 
agarose pad. A Zeiss Scope A.1 microscope (Zeiss, Germany) equipped with 
lOOx (1 X 25 oil ph3 oo/0. 17) objective and 60 N-C 1” lOOx camera was used. 
Images were analyzed with /\xiovision and Imaged (NIH, USA) software. 

Gel and Fluorescence-Based Degradation Assays 

Degradation of proteins was monitored using SDS-PAGE gels as described 
previously (Bhat et al., 2013). The concentrations of different proteins used 
in degradation reactions are indicated in the figure legends. Degradation of 
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GFP-ssrA and GFP-ssrA-SS were monitored with the ioss of fluorescence over 
time as described previously (Smith et al., 2014). 

Western Blot Analysis 

Cell cultures withdrawn at indicated time points were spun down, resus- 
pended in 2x SDS sample buffer, boiled at 95°C for 10 min, and then 
centrifuged. After centrifugation, the clarified supernatant was loaded onto 
SDS-PAGE gels. Proteins were then transferred to a polyvinylidene difluoride 
(PVDF) membrane at 20V for 1 hr and probed with polyclonal rabbit anti- 
CtrA (1:5000 dilution), anti-McpA (1:10000 dilution), anti-TacA (1:10000 dilu- 
tion), anti-CC3144 (1:5000 dilution), anti-CIpP (1:5000 dilution), anti-MreB 
(1:10000 dilution), or monoclonal mouse anti-FI_AG M2 (1:5000 dilution; 
Sigma, USA) antibodies. Proteins were visualized with either goat anti-rabbit 
(Millipore, USA) or goat anti-mouse (Millipore, USA) secondary antibodies 
conjugated to HRP using a chemiluminescence detection system (G:box 
Chemi XT4, Syngene, UK). 

Analytical Size-Exclusion Chromatography 

Analytical size-exclusion chromatography was performed to detect interaction 
between proteins using a Superdex 200 5/150 column (GE Healthcare, USA). 
The column was equilibrated with H-buffer (20 mM HEPES-KOH, pH 7.5, 
100 mM KCI, 10 mM MgCl 2 , 10% glycerol, 5 mM |3-mercaptoethanol) and 
standardized using thyroglobulin, y-globulin, ovalbumin, myoglobulin, and 
vitaminBi 2 as protein standards (Biorad, USA). Purified RcdA or RcdAAC 
either alone or in combination with TacA or TacA variants was loaded onto 
the column. The elution profile was monitored by measuring absorbance at 
280 nm. 

In Vitro and In Vivo Puii-Down Assays 

In vitro pull-down assays were performed by incubating purified hise-RcdA 
(10 ^lM) or hiSe-RcdAAC (10 ^lM) or TacA (312-488) (5 i^M) or PdeA (5 i^M) 
either alone or together with 50 |al pre-equilibrated Ni-NTA resin (Thermo- 
Scientific, USA) at 4°C for 1 hr. The resin was washed twice with H-buffer 
supplemented with 20 mM imidazole. Bound complex was eluted with 
H-buffer containing 200 mM Imidazole. Eluted proteins were analyzed by 
SDS-PAGE gels. 

In vivo pull-down assays were performed by loading lysates (containing 
equal amount of protein) from cells expressing either M2 peptide, M2-RcdA, 
or M2-RcdAAC onto a pre-equilibrated anti-FI_AG M2 affinity resin packed col- 
umn (Sigma, USA). The buffer used for equilibration was 50 mM Tris (pH 7.5) 
and 1 50 mM NaCI (TBS). The resin was washed twice with TBS, and the bound 
proteins were eluted with 0.1 M glycine-HCI (pH 3.5) buffer. Eluted proteins 
were analyzed by silver staining and identified by tandem mass spectrometry 
(see Supplemental Experimental Procedures). 

Fluorescence An isotropy- Based Assay 

Fluorescence anisotropy experiments were performed using a fluorescein 
isothiocynate (FITC) labeled, 19-residue C-terminal peptide of RcdA (FITC- 
EAPRPVQNQLDRLTAAFGG, LifeTein, USA). As indicated in Figure 3G, the 
peptide was incubated with proteins in H-buffer. Anisotropy was monitored 
at 30°C with a Spectramax M5 (Molecular Devices) plate reader with excitation 
and emission wavelengths set at 488 nm and 525 nm, respectively. 

SUPPLEMENTAL INFORMATION 

Supplemental Information includes Supplemental Experimental Procedures, 
five figures, and two tables and can be found with this article online at http:// 
dx.doi.org/1 0. 1 01 6/j.cell.201 5.09.030. 
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SUMMARY 

Most short-lived eukaryotic proteins are degraded 
by the proteasome. A proteolytic core particle (CP) 
capped by regulatory particles (RPs) constitutes 
the 26S proteasome complex. RP biogenesis culmi- 
nates with the joining of two large subcomplexes, 
the lid and base. In yeast and mammals, the lid 
appears to assemble completely before attaching 
to the base, but how this hierarchical assembly is 
enforced has remained unclear. Using biochemical 
reconstitutions, quantitative cross-linking/mass spec- 
trometry, and electron microscopy, we resolve the 
mechanistic basis for the linkage between lid bio- 
genesis and lid-base joining. Assimilation of the final 
lid subunit, Rpn12, triggers a large-scale conforma- 
tional remodeling of the nascent lid that drives 
RP assembly, in part by relieving steric clash with 
the base. Surprisingly, this remodeling is triggered 
by a single Rpn12 a helix. Such assembly-coupled 
conformational switching is reminiscent of viral 
particle maturation and may represent a commonly 
used mechanism to enforce hierarchical assembly in 
multisubunit complexes. 

INTRODUCTION 

The eukaryotic proteome is constantly remodeled to control cell 
function. This remodeling depends heavily on protein degrada- 
tion by the ubiquitin-proteasome system (UPS) (Finley, 2009; 
Ravid and Hochstrasser, 2008). The UPS consists of a cascade 
of enzymes that catalyze the transfer of the protein ubiquitin to 
protein substrates destined for degradation by the 26S protea- 
some. The proteasome is a 2.5 MDa ATP-dependent protease 
complex composed of a barrel-shaped proteolytic core particle 
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capped on its open ends by 1 9S regulatory particles (Tomko and 
Hochstrasser, 2013). Under certain conditions, the regulatory 
particles (RP) can be divided into subcomplexes called the lid 
and base (Glickman et al., 1998). The lid consists of nine sub- 
units, Rpn3, 5-9, 11, 12, and Semi. The base contains a ring 
of six AAA+ family ATPases, Rpt1-6, and the non-ATPase 
subunits Rpnl, 2, 10, and 13. The lid removes the ubiquitin tag 
from substrates, whereas the base uses mechanical energy 
derived from ATP hydrolysis to unfold substrates and insert 
them into the core particle (CP) for destruction (Nyquist and Mar- 
tin, 2014; Tomko and Hochstrasser, 2013). 

The proteasome consists of at least 33 distinct subunits. Its 
assembly is a tightly coordinated process that relies both on 
dedicated extrinsic assembly chaperones and intrinsic features 
of the subunits themselves. Whereas both the CP and RP base 
depend heavily on assembly chaperones for efficient assembly, 
the lid can assemble independently of any additional eukaryotic 
factors (Fukunaga et al., 2010; Tomko and Hochstrasser, 201 1 , 
2014). Lid biogenesis appears to follow a defined assembly 
sequence that culminates with addition of the Rpnl 2 subunit 
to a nearly complete lid intermediate consisting of Rpn3, 5-9, 
11, and Semi, called lid particle 2 (LP2) (Tomko and Hoch- 
strasser, 2011). Recombinant lid forms efficiently in the absence 
of the base or CP (Lander et al., 201 2; Tomko and Hochstrasser, 
2014). Importantly, LP2 is unable to participate in further 26S 
proteasome assembly unless Rpnl 2 is present to complete lid 
formation (Tomko and Hochstrasser, 2011). 

Incorporation of Rpnl 2 into LP2 licenses the resultant com- 
plex for assembly into full proteasomes, but the molecular mech- 
anism underlying this critical function for Rpnl 2 has remained 
obscure (Tomko and Hochstrasser, 2011). In the mature 26S 
proteasome, Rpnl 2 occupies a peripheral position within the 
RP (Figures 1 A and 1 B) and contributes minimally to the interface 
between lid and base (Matyskiela et al., 2013; Unverdorben 
et al., 201 4). Despite this, the LP2 intermediate has no detectable 
affinity for the base in the absence of the latter subunit (Tomko 
and Hochstrasser, 2011). 
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Figure 1. Reconstitution of the 19S Regulatory Particle Using Purified Components 

(A) Pseudoatomic model of the proteasome regulatory particle (RP) (from PDB 4CR2) indicating the position of Rpn12 (red) with respect to (yellow) and base (blue) 
subunits. For clarity, the CP, as well as the RP subunits Rpn1 and Rpn10, is omitted. 

(B) The Rpn12-Glu271 residue (shown in cyan) lies near the lid-base interface. CP, Rpn1, and Rpn10 have been omitted as in (A). 

(C) RP assembly depends on Rpn12. The indicated recombinant components (1 ^iM) were incubated together in the presence of 1 mM ATP and 10 ^iM re- 
combinant Rpnl 0 for 20 min at 30°C before separation by native PAGE. Gels were either stained with Coomassie brilliant blue or transferred to PVDF membranes 
followed by immunoblotting with antibodies to FI_AG (on the Rpti base subunit), HA (on the Rpn7 LP2 subunit), or Rpnl 2. 

(D) The rpn12-E271K mutation weakens lid-base interaction without interfering with lid formation. The indicated components (1 laM) were incubated with 1 mM 
ATP and 10 |iM RpnIO before analysis as in (C). 

See also Figure SI . 
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Proteasomes have recently emerged as important targets for 
the treatment of certain cancers (Crawford and Irvine, 2013). 
All of the anti-proteasome drugs currently FDA approved or in 
clinical trials are inhibitors of the CP active sites (Dou and 
Zonder, 2014), but drug resistance is already a significant prob- 
lem (Cheriyath et al., 2007; Richardson et al., 2003). Interference 
with proteasome assembly could provide a valuable alternative 
strategy for chemotherapy. As incorporation of Rpn12 is obliga- 
tory for subsequent RP assembly, interference with this final step 
of lid assembly would be an attractive target for such therapies. 
However, this will require more detailed knowledge of lid bio- 
genesis and a better understanding of how Rpn12 regulates 
RP assembly. Thus, a central challenge in studying proteasome 
biogenesis, as well as that of many other multisubunit com- 
plexes, is to understand precisely how hierarchical assembly 
events are enforced in vivo. 

Single-particle electron microscopy (EM) and mass spectrom- 
etry (MS) are powerful tools to investigate large macromolecular 
structures. Although these approaches have been used exten- 
sively with the fully assembled 26S proteasome (Lander et al., 
2012; Sharon et al., 2006, 2007; Unverdorben et al., 2014; Witt 
et al., 2006), little information is available on the structures of 
proteasome assembly intermediates (Kock et al., 2015). Appli- 
cation of structural methods to assembly intermediates could 
provide substantial insight into the molecular mechanisms guid- 
ing proteasome biogenesis. 

Quantitative cross-linking/MS (QCLMS) has recently emerged 
as a means to provide insights into dynamic aspects of protein 
structures (Fischer et al., 2013). An early application revealed 
structural changes within a multi-protein complex upon phosphor- 
ylation (Schmidt et al., 2013). Here, using a newly established 
workflow, we apply QCLMS to both the LP2 assembly intermedi- 
ate and full lid; this has revealed multiple conformational changes, 
including the repositioning of the Rpn8-Rpn11 deubiquitinating 
enzyme (DUB) module. In addition, by comparing EM-derived mo- 
lecular models of LP2, the free lid and the lid docked within the 26S 
proteasome, we show that incorporation of Rpnl 2 results in large- 
scale conformational remodeling of the nascent lid from a 
compact “closed” state in LP2, which is autoinhibited for base 
binding, to an open, more flexible state that can pair with the 
base. Remarkably, the highly conserved C-terminal a helix of 
Rpnl 2 is sufficient to promote conversion of LP2 to a lid-like 
conformation, enabling assembly of LP2 into the RP. These find- 
ings rationalize previous biochemical and genetic data indicating 
a critical role for Rpnl 2 in RP assembly, and provide novel insights 
into chaperone-independent proteasome assembly. Coupling 
small, local structural switches to large conformational changes 
is reminiscent of certain steps in virus assembly. Such locally 
induced conformational rearrangements are likely to be used in 
the hierarchical assembly of other multisubunit complexes, such 
as snRNPs, group II chaperonins, and ribosomes. 

RESULTS 

Reconstitution of Proteasome RP Assembly from 
Recombinant Constituents 

Previously, we found that purified yeast LP2 assembled into 
full proteasomes when added to a yeast extract containing 



wild-type Rpnl 2, but not one with an Rpnl 2 truncation lacking 
C-terminal residues 212-274 (Figure SI) (Tomko and Hoch- 
strasser, 201 1 ). This could be due to a requirement for full-length 
Rpnl 2 or from an inhibitory effect of the truncated Rpnl 2 pro- 
tein. Assembly might also require other factors in the extracts 
such as molecular chaperones. Thus, we sought a system of 
reduced complexity to analyze RP assembly. Toward this end, 
we established an in vitro RP assembly assay that consisted 
exclusively of bacterially expressed recombinant proteins. 

Interaction of recombinant LP2 and a recombinant base 
precursor containing the Rpnl 4, Nas6, and Hsm3 assembly chap- 
erones (hereafter called base) (Beckwith et al., 2013) was entirely 
dependent on Rpnl 2 (Figure 1C). Notably, Rpnl 2 could be sup- 
plied as part of the lid or added ectopically to the LP2 and base 
mixture with comparable results. In the pseudoatomic structure 
of the 26S proteasome determined by cryo-EM (PDB 4CR2), 
Rpnl 2 Glu271 lies nearthe interface between the lid and base (Fig- 
ure 1 B). As previously observed in yeast cells (Tomko and Hoch- 
strasser, 2011), mutation of this highly conserved residue to lysine 
also impairs lid-base attachment in the in vitro system, but it does 
not compromise the stability of Rpnl 2 within the lid (Figure ID). 
These findings support our previous observations that Rpnl 2 
incorporation controls lid-base association in yeast and is regu- 
lated at least in part by the C-terminal helix of Rpnl 2. 

QCLMS Reveals Local LP2-Lid Structural Changes 

Rpnl 2 occupies a peripheral position within the RP and provides 
a very small fraction of the total interface between the lid 
and base complexes. This suggests that Rpnl 2 may instead 
promote lid-base joining primarily by inducing conformational 
changes in the assembling lid. To help determine how Rpnl 2 
mediates lid-base joining, we first utilized QCLMS (Fischer 
et al., 201 3) to identify protein cross-links that were unique or en- 
riched in LP2 or lid, with the expectation that these differences 
reflect differences between the solution state conformations of 
these two particles. We purified each of the complexes from 
yeast (Figures S2A and S2B) and subjected them to cross-linking 
with non-deuterated or deuterated forms of the amine-reactive 
cross-linker BS^, followed by protease digestion, 1:1 sample 
mixing, and LC-MS/MS analysis (Figure 2A). We used conditions 
identified in preliminary experiments that yielded only intra- 
complex cross-links (Figures S2C and S2D). The cross-linking 
experiments were done twice but with the non-deuterated and 
deuterated BS^ swapped between LP2 and lid samples; this 
replicate analysis with label-swap controlled for any labeling 
bias and ensured accurate quantitation, especially for cross- 
links that are unique in either lid or LP2 (Figures S2E-S2G). 

We identified and quantified 1 22 unique cross-links (Table SI ). 
Among them, 81 were quantified consistently in label-swap rep- 
licas. Most of these 81 cross-links showed nearly equal preva- 
lence in the lid and LP2 samples (i.e., were centered on log 2 = 0; 
Figure 2B, i, and Figures S2E and S2F), which is consistent 
with 1:1 sample mixing (Figure 2B). However, one cross-link 
was unique to lid, and three were unique to LP2 (Figure 2B, ii). 
In addition, 16 of the 77 cross-links observed for both lid and 
LP2 were significantly enriched in either lid or LP2 (“Significance 
A” test, Perseus version 1 .4.1 .2 [Cox and Mann, 2008]; p < 0.05). 
Of these 16 cross-links, 15 changed more than 4-fold in at least 
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one of the replicated analyses (Figure S2F). Importantly, none 
of these cross-links directly involved Rpn12 or were in positions 
expected to be sterically occluded by Rpn1 2 based on the pseu- 
doatomic structure of the 26S proteasome, strongly suggesting 
that these differences resulted from structural variance between 
the two particles. 

Examination of the cross-links enriched in either LP2 or the 
lid suggested two major conformational differences between 
the two particles. The first was the relative positioning of the 
Rpn8-Rpn11 module (regions A1 [Rpn8 residues 299-309], A2 
[Rpn11 residue 267], and B [Rpn7 residue 2], Figures 2C and 
2D) within the complexes, and the second was the positioning 
of the Sem1 N terminus, the N-terminal a-helical domain of 
Rpn3, and the C-terminal a-helix of Rpn7 relative to one another 
(regions C1 [Sem1 residues 2-26], C2 [Rpn7 residues 416-426], 
and C3 [Rpn3 residues 299-389], Figure 2C). The latter three 
subunits yielded a web of cross-links that was selectively en- 
riched in LP2, suggesting that these regions were more closely 
packed in LP2 (pink lines between regions Cl, 2, and 3, Fig- 
ure 2C). The enrichment of cross-links between the C-terminal 
helix of Rpn7 and residues in the N-terminal extension of 
Rpn3 (Figure SI) suggests that the helical bundle, or at least 
the C-helix of Rpn7, may be compacted against the body of 
Rpn3 and/or Rpn7 in LP2, and that it is repositioned upon incor- 
poration of Rpn12. As Rpn12 forms an extensive interface with 
Rpn3 and appears to make little to no contact with Rpn7 in the 
26S proteasome structure (Matyskiela et al., 2013; Unverdorben 
et al., 2014), it is likely that these changes are mediated through 
Rpn3. 

Regarding the Rpn8-Rpn11 module, cross-links between 
Rpn8 and the C-terminal helix of Rpn9 (Figure SI) were enriched 
or unique to LP2 (Figure 2C), and an LP2-specific cross-link be- 
tween Rpn8 and the extreme N terminus of Rpn7 was observed 
(red line between Al and B, Figure 2C). In contrast, we observed 
lid-enriched cross-linking between the C terminus of Rpn8 
and the C-terminal helix of Rpn7, the N-terminal region of 
Rpn3, and the N-terminal region of Semi (regions Cl and C3, 
light blue lines. Figure 2C). Further, a cross-link between the 
C-terminal helix of Rpnl 1 and the N-terminal region of Semi 
was detected only in the lid (regions A2 and Cl , dark blue line. 
Figure 2C). A series of common cross-links between Rpn8 and 
Rpnl 1 suggests that the module itself is unlikely to vary substan- 
tially in structure between LP2 and lid. 



Taken together, the enrichment of cross-linking in different 
regions of the lid and the loss or depletion of cross-linking to 
Rpn9 suggest that the Rpn8-Rpn11 module undergoes sub- 
stantial movement upon incorporation of Rpnl 2 (Figure 2D). 
We suggest that a rigid-body rotation of the Rpn8-Rpn1 1 dimer 
(modeled in Figure 2E) may occur upon incorporation of Rpnl 2 
that would account for the observed changes in cross-linking. 

The N Terminus of Rpn5 Is Highly Flexible in the 
Undocked Lid 

Currently, the only structural information on the isolated lid is 
from negative-stain EM (Lander etal., 2012). However, this struc- 
ture is not at sufficiently high resolution to aid in visualizing our 
cross-linking data. To provide insight into the solution confor- 
mation of the free lid, we mapped our cross-link network for 
the isolated lid onto the lid within the cryo-EM structure of the 
26S proteasome (Unverdorben et al., 2014). 61 cross-links could 
be mapped onto this structure (Figure 3, Data SI , and Table SI). 
Surprisingly, 26 of the 61 cross-links spanned distances greater 
than the calculated maximum distance between a-carbons in 
BS^-cross-linked residues (based on spacer length of BS^ and 
side-chain lengths of cross-linked residues; Supplemental 
Experimental Procedures) (Figure 3B), suggesting that the free 
lid is much more flexible than the docked lid. Two clusters of 
over-length inter-subunit cross-links were apparent. The first 
consisted of five cross-links between Rpn9 and either Rpn8 or 
Rpnil, indicating that the Rpn8/Rpn11 dimer is positioned 
closer to Rpn9 in the free lid than in the mature proteasome. 
Importantly, together with the results of our LP2-lid comparison, 
this further supports a model in which Rpn8/Rpn1 1 undergoes 
substantial movements during RP assembly. 

The second cluster involved 19 cross-links between the 
N-terminal extension of Rpn5 and either itself or other lid sub- 
units, with several of these cross-linked residues being over 
1 00 A apart in the 26S proteasome. Further, cross-links involving 
Rpn5-K108 were observed with residues from four subunits 
(Rpn3, Rpn6, Rpn8, and Rpnil) that are, in some cases, very 
far from one another in the proteasome-docked lid, suggesting 
substantial mobility of this domain of Rpn5 in the free lid. This 
was further supported by the observation that over-length 
intra-subunit cross-links were observed only within Rpn5 and 
not within any other subunit. Taken together, these data strongly 
suggest that the lid Rpn5 N-terminal domain is highly flexible 



Figure 2. Quantitative Cross-iinking/Mass Spectrometry Reveals Local Differences in LP2 and Lid Structures 

(A) Overview of quantitative cross-iinking/MS anaiysis of LP2 and iid compiexes. 

(B) /: foid changes [iog2(LP2/iid)] of 81 quantified cross-iinks (consistentiy quantified in both forward- and reverse-iabeied experiments) were piotted against 
their signai intensities. These quantified cross-iinks were divided into five subgroups and coiored accordingiy— three are unique to the LP2 sampie (red), one is 
unique to the iid sampie (biue); among 77 cross-iinks observed in both sampies, 61 common cross-iinks showed no significant difference in yieids in the LP2 and 
the iid sampies (gray), whiie seven cross-iinks are significantiy upreguiated in the LP2 sampie (p < 0.05, magenta) and nine in the iid sampie (p < 0.05, cyan). //: 
observed MSsignais of supporting cross-iinked peptides of “iid unique” cross-iink267^'^^''''-20^^'^^ (ieft), “common” cross-iink 404^'^'^^-150^'^'^’'’' (middie), and 
“LP2 unique” cross-iink 2f^PN7_3QQRPNs 

(C) Cross-iinking network of iid/LP2 subunits. Proteins are shown as bars, and cross-iinks are shown as iines that are coiored according to their quantified 
subgroups (described in B). LP2-unique, LP2-enriched, iid-unique, and iid-enriched cross-iinks, which suggest conformationai differences between LP2 and iid, 
mainiy invoive three regions marked as A, B and C. 

(D) Reiative cross-iink enrichments suggest a repositioning of the Rpn8-Rpn11 heterodimer within the iid structure (derived from PDB 4CR2) upon Rpnl 2 
incorporation that may be important for subsequent RP assembiy steps. Regions A, B, and C are highiighted accordingiy. Regions Cl and B that are not present 
in 4CR2 are iocated based on their cross-iinking partners in the modei. 

(E) A rigid-body rotation of the Rpn8-Rpn1 1 upon Rpnl 2 incorporation may account for the observed change in the Rpn8-Rpn1 1 cross-iink network. 
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Figure 3. Mapping Lid Cross-links onto the Lid in the 26S Pseudoatomic Model Reveals Subunit Flexibility in the Undocked Lid 

(A) Lid cross-links (61) detected in our QCLMS analysis are depicted on the 26S proteasome pseudoatomic structure (PDB 4CR2). Cross-links are displayed as 
straight lines connecting cross-linked residues, and distances between them are measured between Ca atoms. Cross-links are displayed only when both cross- 
linked residues are present in the proteasome pseudoatomic structure. Cross-links exceeding the predicted limit are shown in red, whereas those within the limit 
are shown in blue. 

(B) The differences in length between the 61 cross-links mapped onto PDB 4CR2 in (A) and the theoretical cross-linking limit (e.g., 27 A for lysine-lysine cross- 
links, Supplemental Experimental Procedures) are shown. Distances are between the Ca atoms of cross-linked residues. 

See also Figure S2. 



and free to undergo substantial movement in solution (Lander 
et al., 2012). Incorporation of the lid into the 26S proteasome 
likely stabilizes Rpn5 in the extended conformation via interac- 
tion of its N terminus with the base and CP and is accompanied 
by a repositioning of the Rpn8/Rpn1 1 module away from Rpn9. 

EM Structure of LP2 Reveals a Compact Intermediate 

We next investigated the architecture of LP2 by negative-stain 
EM and single-particle analysis for comparison to the negative 
stain EM structure of the isolated lid (EMD-1993) (Lander et al., 
2012). Raw micrographs of LP2 showed monodisperse particles 
with dimensions ~200 A x 250 A (Figure S3A). Reference-free 
two-dimensional (2D) alignment and classification yielded class 
averages with strong similarities to the previously determined 
structure of the isolated lid (Figure 4A). As expected, LP2 lacked 
density for Rpn12, but surprisingly, also lacked the prominent 
N-terminal extension of Rpn6 that was previously observed in 
the isolated lid (Lander et al., 2012). We saw this absence of 
density for the N-terminal portion of Rpn6 both for LP2 purified 



from yeast, as well as recombinant LP2 produced in E coli 
(Figures S3G and S3H). 

To address whether the observed LP2-lid differences were a 
result of sample preparation differences, we repeated the EM 
analysis with the full lid. 2D class averages of lid purified from 
yeast showed a density for the N-terminal extension of Rpn6 
(Figure 4A). Additionally, a three-dimensional (3D) reconstruction 
of lid obtained with our complexes was nearly identical to that 
determined by Lander et al. (2012) (Figures 4B, S3C, and S3D), 
indicating that sample preparation was not responsible for the 
structural differences between LP2 and lid. Using iterative 
back-projection refinement of LP2 particles with the structure 
of lid low-pass filtered to 60 A as a starting model, we obtained 
a 3D reconstruction of LP2 at 16 A resolution, showing the clear 
domain architecture of the complex (Figures 4B and S3E-S3G). 

The quality of our LP2 structure allowed facile automated 
segmentation based on visual inspection and comparison to 
the previously determined lid density (Figure 4B). In agreement 
with our 2D analysis and protein cross-linking, the overall subunit 
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Figure 4. Electron Microscopy of LP2 and 
Lid Purified from Yeast Reveals Closed and 
Open Configurations, Respectively 

(A) The N-terminal domain of Rpn6 adopts an 
extended conformation in iid, but not in LP2. 
Representative 2D ciass averages of LP2 or iid 
indicating the positions of Rpn12 and the Rpn6 N 
terminus are shown. The width of the boxes is 
-430 A. 

(B) 3D reconstructions of the iid (EMD-1993) and 
LP2 (this study) are shown. 

(C) Difference-density map of iid and LP2 indicates 
that the two major regions of variant density are 
those for Rpn12 and the N-terminai domain of 
Rpn6. 

See aiso Figure S3. 
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architectures of LP2 and lid were very similar. In LP2, the Rpn6 
N-terminal helical domain (Rpn6-N) is folded into a cavity formed 
by Rpn3, Rpn7, Rpn8, and Rpn1 1 . Using 3D difference mapping 
between our LP2 structure and the lid structure from Lander et al. 
(2012), we unambiguously assigned the most prominent 
changes in density in the lid to Rpn12 and Rpn6-N (Figure 4C). 
This observation prompted us to reexamine our QCLMS data. 
Although the Rpn6-N conformation seen by EM in LP2 was not 
overtly reflected in LP2-specific cross-linking of this domain (Fig- 
ure 3), an LP2-unique cross-link was in fact observed between 
Rpn6-Lys1 95 and Rpnl 1 -Lysl 2 (Table SI ). This particular amino 
acid of Rpnl 1 is not modeled in the cryo-EM structure (PDB: 
4CR2), but extrapolation from the most proximal modeled 
Rpnil residue (Thr23) indicates that the cross-link between 
Rpn6 and Rpnl 1 must be well over the maximal predicted cutoff. 
In further support of the compact conformation of LP2 observed 
by EM, two overlength cross-links were observed between 
Rpn6-N and Rpn8 (Rpn6-Lys300 to Rpn8-Thr224 and Rpn6- 
Ser310 to Rpn8-Lys198) for both the lid and LP2 (Figure 3A 
and Table SI). Formation of these cross-links in the lid, as well 
as LP2, suggests that the lid may dynamically sample more 
compact conformations. 

LP2 Is Sterically Incompatible with Positioning in the 
Mature 26S Proteasome 

In the 26S proteasome, the Rpn6 N-terminal helical domain is 
extended and makes contact with subunits of the RP base and 
CP (Lander et al., 2012; Pathare et al., 2012). Since this domain 
is folded into the core of LP2, it raised the possibility that 
adoption of this conformation produced a steric conflict with 
the base that would prevent incorporation of LP2 into the 
assembling RP prior to Rpnl 2 assimilation. By docking the 
previously determined EM structure of the isolated lid onto 
the lid subunits of the pseudoatomic model of the 26S and 
then overlaying our LP2 structure onto this structure, we were 
able to examine whether LP2 could incorporate into the protea- 



some without steric conflict (Figure 5A). In 
the EM structure of the 26S proteasome, 
Rpn3 and Rpn9 curve inward toward the 
center of the RP and make direct contacts 
with Rpn2 and RpnIO, respectively. In 
the LP2 model, Rpn3 and Rpn9 are not curved inward, which 
could contribute to the apparent poor affinity of LP2 for 
the base. Further, in LP2, Rpn6-N cannot make the contacts 
with the core particle and base ATPase ring that are observed 
for this domain in the mature 26S proteasome. Rather, the 
model displayed clear steric clashes between the LP2 Rpn6-N 
conformation and segments within the base ATPase subunits 
Rpt3, Rpt4, and Rpt6 (Figure 5A, inset). This strongly implies 
that the closed conformation observed for LP2 is an autoinhi- 
bited conformation that prevents lid-base association until 
Rpnl 2 incorporation. This model would rationalize the behavior 
of LP2 in vitro (Figure 1) and in yeast (Tomko and Hochstrasser, 
2011). 

Addition of Purified Rpnl 2 to LP2 Is Sufficient to Drive 
Lid Maturation 

If Rpnl 2 relieves an autoinhibited conformation of LP2 to 
promote lid-base joining, then adding purified Rpnl 2 to LP2 
should promote structural changes in LP2 that result in a lid- 
like structure. We treated purified recombinant LP2 with 
an equimolar concentration of recombinant Rpnl 2 and exam- 
ined the structure of the resultant complex by EM. Consistent 
with a role for Rpnl 2 in maturation of the assembling lid, 
2D class averages indicated that Rpnl 2 addition caused exten- 
sion of Rpn6-N to a conformation similar to that observed in 
the purified full lid (Figure 5B). Thus, incorporation of Rpnl 2 
drives a large-scale structural rearrangement of the assembling 
lid that relieves the autoinhibitory conformation of the Rpn6 N 
terminus. 

Using two complementary approaches, we next addressed 
whether relieving the steric conflict between the N-terminal 
domain of Rpn6 and the base was sufficient to drive RP assem- 
bly in the absence of Rpnl 2. In the first, we appended the bulky 
GFP protein to the N terminus of Rpn6, which should physically 
clash with surrounding subunits in the closed conformation and 
potentially induce the extended conformation for Rpn6-N. In the 
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Figure 5. Rpn12 Addition Relieves an Autoinhibitory Conformation 
of the Rpn6 N Terminus 

(A) The LP2 3D reconstruction (transparent surface) was modeled onto the 
pseudoatomic model of the 26S proteasome (PDB 4CR2) using the winged- 
helix horseshoe as a guide point, revealing steric clash (red surface) between 
the N-terminal domain of Rpn6 in LP2 (medium blue surface) and Rpt3, 4, and 
6 (dark blue ribbons) of the base (inset). The corresponding position of the 
N-terminal domain of Rpn6 in the lid is also indicated (light blue ribbons). Rpn3 
(yellow) and Rpn9 (magenta) are highlighted to show differences between 
LP2 and the docked lid. 

(B) Addition of recombinant Rpn12 to purified recombinant LP2 triggers 
extension of the Rpn6 N terminus. The class averages are labeled as in Figure 
4A. The width of the boxes is --430 A. 

second approach, we engineered N-terminally truncated forms 
of Rpn6 that were anticipated to lack the putative autoinhibitory 
domain (Figure 6A). Both the GFP-Rpn6 fusion and N-terminal 



Rpn6 truncation proteins assembled efficiently into LP2 when 
coexpressed with the other LP2 subunits in E. coli (Figures 
S4A and S4B), suggesting that Rpn6-N is dispensable for 
LP2 assembly. As predicted, appending GFP to the N terminus 
of Rpn6 resulted in a conformation of Rpn6-N within LP2 that 
appeared similar to its extended conformation in the full lid 
(Figures 6B and S4C). Importantly, each of the mutant forms of 
LP2 assembled into the RP when recombinant Rpn12 was pro- 
vided (Figure S4D), indicating that they were functional for RP 
assembly. In contrast, neither GFP-LP2 (Figure 6C) nor LP2 
harboring truncations of Rpn6 (Figure 6D) assembled into an 
RP-like structure in the absence of Rpn1 2. Therefore, reposition- 
ing or removal of Rpn6-N is insufficient to promote LP2-base 
joining. This indicates that Rpn12 binding to LP2 triggers addi- 
tional changes in the lid precursor that are necessary for RP 
formation. 

Rpn12 C-Terminal Helix Engagement of the Lid Helical 
Bundle Licenses RP Assembly 

The C terminus of Rpn12 forms an a helix that inserts into a 
pronounced groove within a helical bundle formed by C-terminal 
helices from all the other large lid subunits (Estrin et al., 2013). 
This helix of Rpn12 is important both for the completion of lid 
assembly and engagement of the lid with the base (Tomko and 
Flochstrasser, 2011). The complex interface between the helical 
bundle and the Rpn12 helix would in principle allow Rpn12 to 
sense and transmit signals through multiple lid subunits. We 
therefore asked whether provision of the Rpn12 C-terminal helix 
alone was sufficient to drive lid-base association. 

A fluorescently labeled peptide composed of Rpn12 residues 
254-272 (N12pep) bound to purified LP2 in vitro and could be 
competed off with excess recombinant Rpn12, indicating that 
the peptide bound LP2 in a manner similar to the full-length 
protein (Figure 7A). Indeed, addition of N12pep to purified LP2 
triggered conformational extension of the Rpn6 N-terminal 
domain (Figure 7B), as had been observed with full-length re- 
combinant Rpn12 (Figure 5B). Most importantly, N12pep pro- 
moted LP2-base joining when LP2 and N12pep were supplied 
in molar excess to the base, as revealed by a migration shift of 
the base on native gel blots (Figure 7C). Assembly into the RP 
was not as efficient as it was with recombinant Rpn12, suggest- 
ing that other parts of Rpnl 2 promote assembly, possibly by sta- 
bilizing the Rpnl 2 C-terminal helix. RP formation was specific to 
N12pep, as identical concentrations of two unrelated peptides 
had no effect on LP2-base joining (Figure 7D). In summary, our 
data indicate that engagement of the nascent lid helical bundle 
by the Rpnl 2 C-terminal helix is the key event triggering con- 
formational remodeling of the assembling lid and subsequent 
lid-base joining. 

DISCUSSION 

Flierarchical assembly has emerged as a theme in the regula- 
tion of proteasome biogenesis (Tomko and Hochstrasser, 
2013). In yeast and mammals, most evidence suggests that 
the proteasome lid assembles completely before making 
stable contact with the base complex or its subunits (Fukunaga 
et al., 2010; Thompson et al., 2009; Tomko and Hochstrasser, 
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2011). The mechanism underlying this observation had been 
unknown. Our combined QCLMS, EM, and biochemical recon- 
stitution studies now show that both large-scale conformational 
rearrangements and local repositioning of lid subunits are trig- 
gered by Rpn12 binding. These rearrangements underlie the 
coupling between completion of lid assembly and licensing of 
the lid for attachment to the base. Our findings rationalize pre- 
vious genetic data implicating the Rpn12 C-terminal helix in RP 
formation and reveal how Rpn12 can govern lid-base joining 
despite making minimal contacts with the base in the mature 
proteasome. 

Local and Global Conformational Changes during 
Lid Assembly 

Mapping the quantified cross-links onto the 26S proteasome 
pseudoatomic structure reveals that the Rpn8/Rpn11 module 
is repositioned as assembly progresses from LP2 to lid and 
finally to the full proteasome (Figures 2D and 3). Recently, it 
has been shown that the isolated recombinant Rpn8/Rpn11 
heterodimer has significant deubiquitinating (DUB) activity 
(Worden et al., 2014). Intriguingly, we did not detect DUB activ- 
ity in LP2 (Figure S4E and data not shown) or in the full lid (data 
not shown). It is unclear whether the separate Rpn8-Rpn11 
module occurs physiologically either as a lid assembly interme- 



Figure 6. Relief of Steric Conflict Imposed 
by the Rpn6 N Terminus Is Not Sufficient to 
Drive Binding of LP2 to the Base 

(A) The positions of truncations to Rpn6 are shown 
on the pseudoatomic structure of the iid from PDB 
4CR2. The base and core particie subunits are 
omitted for ciarity. The positions of Rpn6 amino 
acid 173 and Rpn6 amino acid 212 (the first 
endogenous residues present in the A172 and 
A211-rpn6 truncation mutants, respectiveiy) are 
shown as yeiiow spheres. 

(B) Fusion of GFP to the N terminus of Rpn6 within 
LP2 causes the N terminus to assume a confor- 
mation simiiar to that observed for the iid. The 
width of the box is ^430 A. 

(C) Addition of LP2 containing a GFP-Rpn6 fusion 
to the base precursor is not sufficient to trigger 
LP2-base association. Untagged LP2, LP2 with 
N-terminaiiy GFP-tagged Rpn6 (GFP-LP2), or iid 
(1 i^M each) were added to 1 i^M base and 1 0 |xM 
RpnIO before anaiysis by native PAGE. Asterisk, 
poorly resolved or potentially aggregated protein(s). 

(D) Addition of LP2 containing truncated forms of 
Rpn6 is not sufficient to trigger LP2-base associ- 
ation. Untagged LP2, LP2, lid, or the indicated 
forms of Rpn6 truncation mutant LP2 (1 ^iM each) 
were added to 1 i^M base and 1 0 i^M Rpnl 0 before 
analysis by native PAGE and immunoblotting. 

— RP Asterisk, cross-reactive band. 

_ Base See also Figure S4. 

> LP2/Lid 



♦ 

diate or a lid subcomplex with extra- 
proteasomal functions (Hofmann et al., 
2009; Rinaldi et al., 2004, 2008), but we 
speculate that Rpnl 1 DUB may be held 
inactive in the context of LP2 and lid. 

The repositioning suggested by our QCLMS data may be 
required for Rpnil activation during incorporation into the pro- 
teasome holoenzyme, perhaps by relief of inhibitory contacts 
with other lid subunits. It is unlikely that Rpn6 would serve this 
autoinhibitory role given that Rpn6-N is extended away from 
the Rpn8/Rpn1 1 module in the EM structure of the lid. However, 
the high flexibility of Rpn5 (Figure 3) makes it an attractive candi- 
date for this putative function. Notably, the Rpn5 paralog of 
the lid-related COP9/signalosome (CSN), CSN4, engages and 
inhibits the activity of the CSN6-CSN5 module (paralogous to 
Rpn8/Rpn11) in the fully assembled CSN (Lingaraju et al., 
2014). This inhibition is released upon substrate engagement 
to stimulate the deneddylase activity of CSN5. 

The lid is generally thought to interact stably with the base 
(and perhaps the CP), and Rpnl 1 DUB activity is regulated in 
part through ATP binding and/or hydrolysis by the base (Liu 
et al., 2006; Matyskiela et al., 201 3; Verma et al., 2002). However, 
Rpn8 and Rpnil make very little contact with base subunits 
in the proteasome holoenzyme. Our observation that Rpnl 2 
binding to the lid at a remote location can reposition the Rpn8- 
Rpnll module suggests the possibility that ATP binding or 
hydrolysis by the base might also allosterically regulate this 
module through movements of the lid rather than direct contacts 
with the ATPase heterohexamer itself. 
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Figure 7. Engagement of the Lid Helical 
Bundle by the Rpn12 C-Terminal Helix Li- 
censes the Lid for RP Assembly 

(A) A fluorescently labeled peptide corresponding 
to amino acids 254-272 from yeast Rpn12 
(N12pep) binds to LP2 similarly to full-length 
Rpn12. The average difference in fluorescence 
anisotropies between the free fluorescein-labeled 
N12pep and N12pep after addition of the indicated 
proteins are shown. Error bars, SEM. 

(B) Addition of N12pep triggers extension of the 
Rpn6 N terminus from the body of LP2. N12pep 
(10 i^M) was added to purified, untagged LP2, 
followed by analysis by negative-stain EM. The 
width of the box is ^430 A. 

(C) N12pep triggers association of LP2 and base in 
the absence of Rpnl 2. EGFP-LP2 (1 or 1 0 |iM) was 
incubated with 1 ^iM base, 10 laM RpnIO, and the 
indicated concentration of N12pep before native 
PAGE and immunoblotting. RP*, an RP-like spe- 
cies devoid of Rpnl 2 but reactive with both lid 
(not shown) and base antibodies. 

(D) N12pep, but not V5 or SxFLAG control 
peptides, trigger LP2-BaseP association. LP2 
harboring an N-terminal EGFP tag on Rpn6, base 
harboring an N-terminal MBP tag on Rpti, and 
RpnIO were incubated as in (C) with 1 00 laM of the 
indicated peptides, followed by analysis as in (C). 
An arrowhead marks RP*. 

(E) Model for Rpnl 2-dependent licensing of the lid 
for lid-base joining. LP2 assumes a closed 
conformation with the N-terminal domain of Rpn6 
folded inward, causing steric clash with the base. 
During incorporation of Rpnl 2 into LP2, engage- 
ment of the C-terminal helical bundle triggers 
extension of the Rpn6 N terminus, as well as 
additional, as-yet-unknown events (not depicted) 
that promote a conformation of the lid that is 
competent for binding the base. 



The Rpnl 2 C-Terminal Helix as an Allosteric Trigger 

Our EM analyses and reconstitution experiments demonstrate 
that the Rpn12 C-terminal helix initiates a global remodeling of 
the lid structure from an autoinhibited conformation to one that 
can join to the base. Importantly, this single helix is sufficient 
to promote both the remodeling of LP2, as well as interaction be- 
tween LP2 and base. Although the peptide was not as potent as 
full-length Rpn12 in the base-joining assay, this could simply 
reflect a lower affinity of the isolated peptide for LP2 compared 
to Rpn12. Other elements of Rpn12 may enhance binding of 
the C-terminal helix to LP2 via Rpn12 interactions with Rpn3. 
Additional contacts between Rpn12 and LP2 (and base) may 
also be necessary to promote an optimal conformation of the 
lid for binding to the base. 

We found that Rpn6-N undergoes a striking movement from 
a position pressed against other subunits in LP2 to an extended 
conformation in the lid. Computational docking of the LP2 
model onto the 26S proteasome indicates that steric clash 
would occur between the base ATPase ring and Rpn6-N. As 
neither displacement nor removal of the putative autoinhibitory 
domain alone was sufficient to trigger stable interaction be- 
tween LP2 and the base, additional changes must also be 
needed. Cther large steric clashes were not readily apparent 



in our modeling of LP2 onto the 26S proteasome, but this could 
simply reflect differences in the conformation of the assembly 
chaperone-bound base precursor used in our study and the 
chaperone-free base in the context of the 26S holoenzyme 
observed in previous EM studies. No structural information 
currently exists on the chaperone-bound base precursor. The 
combination of OCLMS and EM approaches used here could 
be valuable in exploring the structure of the base precursor 
relative to the structure of the base in the context of the mature 
proteasome; such differences might also be important for regu- 
lating lid-base joining. 

In this study, QCLMS was applied for the first time to study 
conformational changes involved in macromolecular assembly. 
Compared to a previous use of isotope-labeled cross-linkers 
for quantification (Schmidt et al., 2013), our workflow advanced 
a number of critical aspects: replicate analysis with label swap 
ensured accurate quantitation; summarizing quantitation into 
residue pairs reduced the impact of unrelated influences that 
would be felt at the peptide level (such as additional modifi- 
cations or digestion efficiency); and use of well-established 
quantitative proteomics software minimized errors in manual 
analysis (Schmidt et al., 2013) and significantly reduced analysis 
time. These advances in turn enabled the depth and magnitude 
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needed for the analysis of large protein assemblies, as presented 
here. 

Recently, a 5-subunit subcomplex of the lid (without Rpn12) 
was shown to bind to a chaperone-free base-CP complex 
in vitro (Yu et al., 2015). It is not yet clear whether this species 
points to an alternative route to RP formation, as it has not 
been reported to form naturally in vivo and it is unknown if it 
is competent for assembly into full 26S proteasomes. However, 
alternative routes to RP formation are not ruled out by our data. 
For example, the full lid bound to a subset of base subunits was 
isolated from bovine erythrocytes and shown to be capable of 
assembly into full RPs (Thompson et al., 2009). Alternatively, 
RPs or 26S proteasomes might be partially disassembled 
under some conditions (Fane and Prevelige, 2003; Nanduri 
et al., 2015). 

As engagement of the LP2 helical bundle by the Rpn12 
C-terminal helix was sufficient to promote interaction between 
LP2 and base independent of the rest of Rpn12, an allosteric 
signal is likely transmitted through the helical bundle. Rpn12 
binding may reposition this bundle in a manner conducive 
for base binding. Further structural studies will be necessary 
to identify the specific subunit reconfigurations that drive 
lid-base joining. Another important challenge will be to deter- 
mine the allosteric path of signal transduction from the Rpn12 
C-terminal helix to the rest of the lid during conformational 
switching. 

The Rpn12-lnduced Conformational Rearrangement Is 
Reminiscent of Viral Maturation 

Several parallels can be drawn between proteasome biogenesis 
and the assembly of viruses. As with the proteasomal CP and RP 
base, assembly of viral capsids frequently depends on exoge- 
nous chaperone proteins to form functional structures with the 
proper subunit arrangements (Fane and Prevelige, 2003). 
Further, in analogy to lid assembly and lid-base joining, incorpo- 
ration of major viral subcomplexes also follows hierarchical 
assembly orders. For example, the head, base, and tail fibers 
of the T4 bacteriophage each assemble independently prior to 
joining together (Aksyuk and Rossmann, 2011). Finally, the cap- 
sids of many class I viruses undergo dramatic conformational 
rearrangements upon completion of distinct assembly events 
(Mateu, 2013). These rearrangements are coupled to the assem- 
bly state either via proteolytic cleavage reactions or by allosteric 
contacts analogous to those observed for LP2 and Rpn12. They 
in turn license the capsid for subsequent assembly steps, such 
as viral DMA packaging or attachment to the tail. Thus, while 
the biochemical mechanisms differ, these functional themes 
are shared between highly disparate macromolecular com- 
plexes and likely contribute to the faithful biogenesis of other 
multisubunit assemblies. 

Our multipronged approach to identifying structural differ- 
ences between assembly intermediates should prove valuable 
in future efforts to understand conformational switches and their 
role in proteasome assembly. The ability to reconstitute the 
RP in vitro from fully recombinant subunits will greatly facilitate 
biochemical and biophysical analyses of these processes. This 
approach should be readily extendable to studies of the assem- 
bly and action of other large multisubunit complexes. 



EXPERIMENTAL PROCEDURES 
Plasmids 

Plasmids used in this study are listed in Table S2. Generation of multigene 
operons for bacterial expression was done as described (Tomko and Hoch- 
strasser, 2014). All mutations were first introduced into host plasmids contain- 
ing individual subunit expression cassettes, confirmed by DNA sequencing, 
and then transferred by subcloning into the appropriate operons. 

Recombinant Protein Expression and Purification 

All purified proteins were derived from the S. cerevisiae coding sequences 
and were purified either from S. cerevisiae or from E. coii as indicated in the 
text. All proteins and complexes were affinity purified using conventional tech- 
niques, followed by gel filtration. See also Supplemental Experimental 
Procedures. 

Fluorescence Anisotropy Measurements 

A peptide with residues 254-272 of Rpn12 followed by a 5,6-carboxyfluores- 
cein-labeled lysine (NH2-DQKTNIIEKAMDYAISIENIK*-C02H, herein referred 
to as N12pep) was synthesized by Elim Biopharmaceuticals. Fluorescent 
peptide (1 laM) was incubated in Lid Buffer alone, with 1 ^iM LP2, 5 i^M 
Rpn12, or both at 25°C for 20 min. Fluorescence polarization was then 
measured on a two-channel fluorometer (PTI) with linear polarizers with exci- 
tation at 485 nm. The change in anisotropy compared to peptide alone was 
calculated using the average of the anisotropy values over the emission range 
513-528 nm. 

Assembly Assays 

LP2 and its derivatives, lid, recombinant base precursor, and recombinant 
Rpn12 were diluted together to 1 laM each along with 10 i^M RpnIO in 26S 
buffer (50 mMTriseHCI [pH 7.5], 5 mM MgCl 2 , 10% glycerol, 1 mM ATP) unless 
otherwise noted. After incubation at 30°C for 20 min, aliquots of each reaction 
were separated by native polyacrylamide gel electrophoresis (native PAGE) 
at 100 V at 4°C as described previously (Tomko and Hochstrasser, 2014). 
Native PAGE-separated proteins were transferred to PVDF membranes and 
subjected to immunoblot analysis. 

Negative-Stain Eiectron Microscopy 

LP2 and Lid complexes were diluted to ^50 nM in Buffer A (50 mM TrisoHCI 
[pH 7.5], 150 mM NaCI, 5 mM MgCl 2 , 10% glycerol) immediately prior to 
applying the sample to glow-discharged 400 mesh continuous carbon grids. 
After staining with 2% (w/v) uranyl acetate, the residual stain was blotted off, 
and the samples were air-dried in a fume hood. Data were acquired using a 
Tecnai F20 Twin transmission electron microscope at 120 keV at a nominal 
magnification of 80,000 x (1 .45 A at the specimen level) using low-dose expo- 
sures (-^20 e“A“^) with a randomly set defocus ranging from -0.5 to -1 .3 |im. 
A total of 150-600 images of each LP2/lid complex sample were automatically 
recorded on a Gatan 4k x 4k CCD camera using the MSI-Raster application 
within the automated macromolecular microscopy software LEGINON 
(Subway et al., 2005). Specifically, 600, 510, 150, 150, and 300 micrographs 
were acquired of LP2, Lid, LP2 supplemented with purified Rpn12, LP2 with 
EGFP-Rpn6 complexes, and LP2 with peptide, respectively. See also Supple- 
mental Experimental Procedures. 

Cross-linking and Sample Preparation for Quantitative 
Cross-linking/Mass Spectrometry 

LP2 or lid (100 |ag) purified from yeast in Buffer HA (50 mM HEPESeNaOH, 
150 mM NaCI, 5 mM MgCl 2 , 10% glycerol) were each cross-linked with 
bis[sulfosuccinimidyl] suberate-dO (BS^-dO) and its deuterated form 
bis[sulfosuccinimidyl] 2,2,7,7-suberate-d4 (BS^-d4) (Thermo Fisher) at 1:200 
protein to cross-linker. The reaction was incubated at 30°C for 1 hr and 
quenched with 2 M glycine (pH 7.0) for 30 min at room temperature. Mono- 
meric cross-linked lid and LP2 were isolated via SDS-PAGE and in-gel 
digested using trypsin (Chen et al., 2010). After digestion, cross-linked lid 
and LP2 samples were mixed in 1:1 molar ratio, yielding two samples for 
quantitative analysis: lid+BS^-dO/LP2+BS^-d4 and lid+BS^-d4/LP2+BS^-dO. 
Peptides were fractionated by strong cation exchange chromatography and 
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desalted using C18-StageTips for LC-MS/MS (Chen et al., 2010). See also 
Supplemental Experimental Procedures. 

ACCESSION NUMBERS 

The accession number for the LP2 EM density map reported in this paper is 
EMD: 3136. 
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Supplemental Information includes Supplemental Experimental Procedures, 
four figures, two tables, and one data file and can be found with this article 
online at http://dx.doi.Org/10.1016/j.cell.2015.09.022. 
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SUMMARY 

RNA-directed DNA methylation in Arabidopsis 
thaliana is driven by the plant-specific RNA Polymer- 
ase IV (Pol IV). It has been assumed that a Pol IV 
transcript can give rise to multiple 24-nt small inter- 
fering RNAs (siRNAs) that target DNA methylation. 
Here, we demonstrate that Pol IV-dependent RNAs 
(P4RNAs) from wild-type Arabidopsis are surpris- 
ingly short in length (30 to 40 nt) and mirror 24-nt 
siRNAs in distribution, abundance, strand bias, and 
5'-adenine preference. P4RNAs exhibit transcription 
start sites similar to Pol II products and are featured 
with 5'-monophosphates and 3'-misincorporated 
nucleotides. The 3'-misincorporation preferentially 
occurs at methylated cytosines on the template 
DNA strand, suggesting a co-transcriptional feed- 
back to siRNA biogenesis by DNA methylation to 
reinforce silencing locally. These results highlight 
an unusual mechanism of Pol IV transcription and 
suggest a “one precursor, one siRNA” model for 
the biogenesis of 24-nt siRNAs in Arabidopsis. 



INTRODUCTION 

In Arabidopsis, 24-nt siRNAs are the triggers for RNA-directed 
DNA methylation (RdDM), which plays central roles in repres- 
sing transposable elements (TEs) and maintaining genome 
integrity (Law and Jacobsen, 2010; Matzke and Mosher, 
2014). The current model for 24-nt siRNA biogenesis is 
composed of several sequential steps. First, Pol IV recognizes 
heterochromatic regions, in part via SAWADEE HOMEODO- 
MAIN HOMOLOG 1 (SHH1) (Law et al., 2013), and transcribes 
precursor RNAs. These precursor RNAs are then thought to 
be processed by RNA-dependent RNA polymerase 2 (RDR2) 

CrossMark 



to form double-stranded RNAs (dsRNAs). The dsRNAs are pri- 
marily cleaved by dicer-like 3 (DCL3) to produce 24-nt siRNAs 
(Matzke and Mosher, 2014). Although Pol IV is clearly required 
for the biogenesis of 24-nt siRNAs (Herr et al., 2005; Kanno 
et al., 2005; Onodera et al., 2005; Pontier et al., 2005) and 
was shown to be transcriptionally active in vitro (Haag et al., 
2012), the nature and characteristics of Pol IV-dependent tran- 
scripts remain poorly understood. The low transcription level at 
silent Pol IV loci and the efficient downstream processing by 
dicers has made Pol IV transcripts difficult to study. Recently, 
regions containing Pol IV-dependent transcripts were 
described in a dci2l3l4 triple mutant that compromises down- 
stream processing of siRNAs (Li et al., 2015). In that study, with 
the conventional assumption that Pol IV transcribes long pre- 
cursors, RNA was fragmented before cloning, and sequenced 
reads were assembled into contiguous regions that corre- 
sponded to well-known siRNA-producing regions (Li et al., 
2015). In the present study, we sought to identify the very low 
abundance Pol IV-dependent transcripts in a wild-type back- 
ground, and to precisely characterize their start and end posi- 
tions. We utilized a special method for RNA sequencing named 
PATH (parallel analysis of fail and f?ead) that utilizes RNA 
adapters to capture both ends of any RNA greater than 27-nt 
with a 5' monophosphate and a 3' hydroxyl. Using PATH, we 
efficiently cloned Pol IV-dependent RNAs (P4RNAs) and found 
that they were only on the order of 30 to 40 nt in length. These 
P4RNAS mirror 24-nt Pol IV-dependent siRNAs in distribution, 
abundance, strand bias, and 5'-adenine preference, suggest- 
ing that they are direct precursors that often determine both 
the start position and strandedness of siRNAs. We also 
observed extensive incorporation of 3'-non-templated nucleo- 
tides preferentially at methylated cytosines on the template 
DNA strand, suggesting that Pol IV misincorporates and 
terminates at DNA methylated sites. Our results support a 
“one precursor, one siRNA” model for the biogenesis of Pol 
IV-dependent 24-nt siRNAs in Arabidopsis that creates a 
positive feedback loop between siRNA biogenesis and DNA 
methylation. 
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Figure 1 . Identification of Short RNAs Associated with siRNA Loci 

(A) Procedure for the construction and analysis of PATH libraries. An example siRNA locus is shown at the bottom with IGV screenshots of P4RNAs and siRNAs 
matching to that region. 

(B) Abundances are highly correlated for PATH reads (27+ nt) and sRNA reads (18 to 26 nt) in wild-type Col from previously defined Pol IV siRNA loci (Law et al., 
2013). 

(C) Size distribution plots of all PATH reads and sRNA reads in Col from Pol IV siRNA loci. A distinct peak at 30 to 40 nt can be seen in the PATH library; we named 
these “P4RNAs.” 

(D) P4RNAS and siRNAs share the same strand bias at Pol IV siRNA loci. Only Pol IV siRNA loci matched by more than 1 00 P4RNAs in Col were selected to obtain a 
robust calculation of strandedness. The plus-strand ratio was calculated as the abundance of reads matching to the plus strand divided by the total number of 
reads at that locus. 

See also Figures SI , S2, and S3 and Tables SI , S2. 



RESULTS 

Identification of Short RNAs Derived from Pol IV- 
Dependent 24-nt siRNA Loci 

We utilized an RNA cloning scheme called PATH that uses RNA 
ligation and gel-based size selection to capture RNAs larger than 
27 nt that contain a 5' monophosphate and a 3' hydroxyl (Fig- 
ure 1A, see Experimental Procedures). From a wild-type 
Columbia (Col) library, around 162 million PATH reads were ob- 
tained, of which ~50 million could be mapped uniquely to the 
Arabidopsis genome (Table SI), and PATH reads with a minimal 
length of 27 nt were used for further analysis. Although the ma- 
jority of PATH reads matched to structural RNAs (t/r/sn/snoRNA) 
(Table SI), a large number of reads mapped to previously 
defined Pol IV siRNA loci (Law et al., 2013) (Figure 1 A and Table 
S2). In addition to their co-localization, the abundances of 
siRNAs and PATH reads were also highly positively correlated 
(Figure IB). At Pol IV siRNA loci, in addition to the 24-nt siRNA 
peak that is still present in PATH libraries (Figure 1C), we 
observed a secondary peak of PATH reads (P4RNAs) that pri- 



marily ranged from 30 to 40 nt (Figure 1C). The size distribution 
of P4RNAS was different from other PATH reads in the libraries 
such as reads matching tRNAs and snoRNAs (Figure SI A). The 
short length of P4RNAs fits well with the specific preference of 
DCL3 for short (30 to 50 bp) dsRNA substrates (Nagano et al., 
2014), consistent with the hypothesis that these short P4RNAs 
serve as the precursors for siRNA biogenesis. 

Biogenesis and Processing of Pol IV RNAs 

To investigate the biogenesis of P4RNAs, we constructed small 
RNA and PATH libraries from null mutants of NRPD1 (encoding 
the largest subunit of Pol IV), NRPD/E2 (encoding the shared 
second-largest subunit of both Pol IV and Pol V), RDR2, as 
well as from the double mutants of DCL3 NRPD1 and DCL3 
RDR2. We found that both siRNAs and P4RNAs were eliminated 
in all these mutant backgrounds, indicating a complete depen- 
dency of P4RNA biogenesis on both Pol IV and RDR2 (Figure 2A). 
Although the loss of P4RNAs in rdr2 mutants is somewhat 
unexpected given the assumed downstream function of RDR2 
(Haag et al., 2012), this result is consistent with a previous report 
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Figure 2. Biogenesis and Processing of P4RNAs 

(A) Both Pol IV siRNAs and P4RNAs are eliminated in nrpd1 , nrp(dle)2, rdr2, 
dcl3/nrpd1, and dc\3!rdr2. Small RNA abundances were normalized to the 
sum of all TAIRIO-annotated mlRNAs in each sRNA library, and then 
compared to Col; P4RNA abundances were normalized to the total number of 
reads in each PATH library. 

(B) Pol IV siRNAs are reduced in dc!2/3/4, whereas P4RNAs accumulate 
substantially. 

(C) Abundances of P4RNA and siRNA at each locus in Col and dc\2l3l4. 

(D) At long TEs, P4RNAs resemble the distribution of siRNAs - enriched at 
promoters and termini of TEs, and also spread into TE bodies. 

See also Tables S1 and S2. 



(Li et al., 201 5), and suggests that in vivo, Pol IV and RDR2 activ- 
ities are tightly coupled. These results are also consistent with 
the earlier observation that RDR2 is in tight association with 
Pol IV subunits in vivo (Haag et al., 2012; Law et al., 2011). 

To test if P4RNA biogenesis is dependent on downstream pro- 
cessing events by dicer proteins, we carried out additional siRNA 
and PATH library analysis from plants triply mutant for DCL2, 3, 
and 4. Consistent with a recent report detecting Pol IV dependent 
transcripts in a dc\2l3l4 mutant background (Li et al., 2015), we 
observed a dramatically increased accumulation of P4RNAs in 
the triple mutant dc\2l3l4 (Figures 2B and 2C). In contrast, 
siRNAs levels were dramatically decreased in dc\2l3l4 at almost 
all loci (Figures 2B and 2C) and their sizes were shifted from pre- 
dominantly 24 nt to predominantly 21 nt (Figure SI B). In addition, 
we observed that the abundances of the remaining siRNAs in 
dcl2/3/4 were tightly correlated with the abundances of siRNAs 
in wild-type (Figure 2C). We also found that the abundances of 
P4RNAS in dc\2l3l4 were strongly positively correlated with the 
abundances of siRNAs in Col (Figure SIC). As one example, 
P4RNAS and siRNAs showed a very similar enrichment at the 
boundaries of long transposable elements (Figure 2D). The accu- 
mulation of P4RNAS but reduction of siRNAs in dc\2l3f4 mutants 
are inconsistent with the interpretation that P4RNAs are merely 
longer, misprocessed siRNAs, and suggests that P4RNAs are 
indeed the precursors of Pol IV-dependent siRNAs. 

We also analyzed siRNA and PATH libraries from other DCL 
mutant combinations (dcl2/4, dcl3, dc\2l3, and dcl3/4) and 
compared these with dc\2l3l4. As previously reported (Hender- 
son et al., 2006), all dcl3 mutant combinations showed a shift 
in siRNA size from 24 nt to 21 and/or 22 nt (Figure 3A, top). 
PATH sequencing indicated that the over-accumulation of 
P4RNAS was negligible in c/c/2/4; higher in c/c/3, c/c/2/3, and 
c/c/3/4; and highest in dcl2/3/4, indicating that the P4RNA 
processing by DCL2 and DCL4 is less efficient than by DCL3 
(Figure 3A, bottom). As a secondary confirmation of the accumu- 
lation of P4RNAS in c/c/3 mutants, we examined small RNA 
northern blots for the presence of 30- to 40-nt signals. Using 
LNA probes to six different siRNA producing loci we indeed 
observed an RNA smear with larger sizes, along with an accumu- 
lation pattern (highest in dc\2l3l4) that matched the trend of the 
P4RNA level in our PATH libraries (Figures 3B and 3C) (Hender- 
son et al., 2006). In addition, we used a reverse transcription 
qPCR (real-time RT-PCR) approach to verify the very short 
nature of the P4RNAs. Because the peak of P4RNA abundance 
is 30-40 nt, whereas longer P4RNAs are much less abundant 
(Figure 1C), we reasoned that primers spaced about 40 nt from 
each other should amplify P4RNAs much more efficiently than 
those spaced slightly further apart. After normalizing for effi- 
ciency of DNA amplification, we observed a 1 00-fold higher level 
of PCR product when comparing a 36 bp versus a 58 bp ampli- 
con at one locus, and 50-fold higher level of PCR product when 
comparing a 39 bp versus an 81 bp amplicon at a second locus 
(Figure S2A). 

To directly test whether the longer P4RNAs might simply be 
misprocessed siRNAs that are loaded into AG04, we performed 
AG04 RNA-IP (RIP) in the dcl2/3/4 genetic background using 
antibodies against endogenous AG04 and then characterized 
AG04-associated RNAs with high-throughput sequencing. The 
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AG04 RIP result showed that even though there is a massive 
accumulation of longer P4RNAs and a reduction of 24nt 
siRNAs, AG04 still selectively binds to the remaining 22-24 nt 
siRNAs but not the longer P4RNAs (Figure S2C), strongly sup- 
porting our model that P4RNAs are the precursors of the 24-nt 
siRNAs. 

Even though double-stranded RNAs are thought to be inter- 
mediates of siRNA biogenesis, it is known that at many loci in 
the genome, siRNAs are predominantly found on one strand of 
the genome but not the other (Lister et al., 2008; Zhong et al., 
2014). In addition, these strand-biased clusters of siRNAs corre- 
spond to strand biased DNA methylation (Lister et al., 2008; 
Zhong et al., 2014). At these strand-biased siRNA clusters, we 
also observed strongly strand-biased P4RNAs in both Col and 
dc\2l3l4 (Figures ID and SID). These results suggest that our 
PATH libraries are primarily composed of single-stranded, Pol 
IV-derived strands, but not the RDR2-derived second strand. 
In support of this hypothesis, we used RT-PCR to amplify these 
strand-biased clusters, and indeed found that Pol IV/RDR2- 
dependent transcripts could be amplified equally from both 
strands (Figure S3). This suggests that RDR2 strands are likely 
present in vivo but are not cloned by the specific PATH 
sequencing technique employed in this study. The observation 
that P4RNAS and siRNAs show the same strandedness suggests 
that the Pol IV-derived strands, rather than the RDR2-derived 
strands, are strongly favored to become the final 24-nt siRNA 
products. 

In summary, results from siRNA and PATH libraries of mutants 
deficient in siRNA biogenesis or processing, together with the 



Figure 3. P4RNAs in del Mutant Combina- 
tions 

(A) Size of Pol IV siRNAs shifted from 24 nt in Col to 
21/22 nt in c/c/3-containing mutants, accompanied 
by the accumulation of P4RNAs. P4RNA abun- 
dances are normalized to that in Col. 

(B) RNA blot analysis of siRNAs and P4RNAs in del 
mutant backgrounds. 

(C) Size distribution of P4RNAs in c/c/3-containing 
mutants. 

See also Figure S3 and Tables SI and S2. 



shared strandedness of P4RNAs and 
siRNAs, suggest that the 30 to 40 nt 
P4RNAS serve as precursors to Pol IV 
dependent siRNAs. 

Pol IV Transcription Initiates at Pol 
IMike TSSs and Favors 5^-Adenine 

Next we investigated the nature of 
the 5' ends of P4RNAs. Given that our 
cloning method only captures RNAs 
with 5-monophosphates (Figure 1A), we 
sought to measure the proportion of 
P4RNAS containing this type of 5' end. 
To address this, we used Terminator 
exonuclease to preferentially digest 
RNAs with a 5'-monophosphate. We sub- 
sequently measured the abundance of the remaining P4RNAs 
by real-time RT-PCR. Consistent with a recent study (Li et al., 
2015), Terminator treatment degraded the majority of P4RNAs 
at all loci tested (Figure S2B). Thus, while it is possible that 
subpopulations of P4RNAs have other end structures such as 
5'-triphosphates, 5'-caps, or 5'-hydroxyl groups, it appears 
that the majority of P4RNAs contain 5'-monophosphates. 

Because our method did not include a fragmentation step 
that is typical in RNA-seq library protocols (Li et al., 2015), 
it was possible to detect the 5' nucleotide of P4RNA reads. 
We observed a strong enrichment of T/C (Y) at the -1 position 
(the nucleotide immediately upstream of the first nucleotide of 
the P4RNA read) and A/G (R) at the -i-l position (beginning nucle- 
otide of the read) (Figure 4A). Further, the four possible Y/R dinu- 
cleotides at the -1/-I-1 positions were by far the most enriched 
dinucleotides at the 5' end of P4RNAs (Figure 4B). This pattern 
is very similar to that known for the transcriptional start sites 
(TSSs) of RNA Polymerase II (Pol II) in plants and other organisms 
and it is referred to as the “Y/R rule” (Cumbie et al., 2015; 
Nechaev et al., 2010; Yamamoto et al., 2007). This result sug- 
gests that Pol IV has retained this preference from its evolu- 
tionary ancestor Pol II (Ream et al., 2009), and that the 5' ends 
of P4RNAS likely represent Pol IV transcriptional start sites. 
The short TSS like sequences at the 5' ends of P4RNA, along 
with their very short nature does not support previous models 
in which Pol IV initiates transcription solely at the nucleosome- 
depleted promoter regions near the ends of transposons to pro- 
duce long transcripts (Li et al., 201 5). Instead, our results suggest 
that Pol IV can initiate transcription at many positions that 
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Figure 4. P4RNAs Feature Pol ll-like TSSs and Favor 5' -Adenine 

(A) TSSs of Pol IV exhibit preference for C/T at the -1 position and A/G at 
the +1 position, resembling the “Y/R rule” of Pol II TSS, calculated using 
P4RNAS from dc\2l3l4. 

(B) Dinucleotide enrichment at -1/+1 of P4RNA, calculated using P4RNAs 
from dc\2l3l4. 

(C) Both sIRNA and P4RNAs from Col have a strong preference for 5' adenine. 
See also Figures S4 and S5. 



resemble the Y/R features of Pol II TSSs, transcribing many short 
P4RNAS along the length of transposons. 

Because Pol IV transcripts feature Pol ll-like TSSs, we 
also performed genome-wide profiling of Pol II occupancy in 
Arabidopsis via ChIP-seq in wild-type and different mutant 
backgrounds (nrpd1 and dc\2l3l4), and compared this with Pol 



IV ChIP-seq (Law et al., 2013). Our results showed that Pol II 
does not appear to access Pol IV loci even in the absence of 
Pol IV (nrpd1) (Figure S4A). Furthermore, whole-genome bisulfite 
sequencing and small RNA sequencing of floral tissues from the 
weak Pol II mutant (nrpb2-3) did not reveal obvious changes in 
either DNA methylation or siRNA biogenesis (Figures S4B and 
S4C), which is consistent with our previous analysis of nrpb2-3 
using leaf tissue (Stroud et al., 2013). Therefore despite the pro- 
posed crosstalk between Pol II and the RdDM pathway (Zheng 
et al., 2009), our data suggest that Pol II and Pol IV occupy 
distinct territories on the genome. 

Arabidopsis 24-nt siRNAs are primarily loaded into AG04 and 
are strongly biased toward having a 5'-adenine, which was pre- 
viously shown to involve an AG04 loading preference (Havecker 
et al., 201 0; Mi et al., 2008). Interestingly, we found that P4RNAs, 
like siRNAs, also show a strong enrichment for 5'-adenine (Fig- 
ures 4C, S5A, and S5B). Because the size of P4RNAs is approx- 
imately 30 to 40 nt, on average only one 24-nt siRNA duplex 
could be processed from each of these P4RNA precursors. 
This fact, coupled with the shared 5' adenine preference and 
the shared strand preference, suggests that 24-nt siRNAs are 
preferentially cleaved from the 5' portion of P4RNAs. Consistent 
with this hypothesis, DCL3 was shown to prefer short double- 
stranded RNAs (30 to 50 bp) that contain a 5' adenine (Nagano 
et al., 2014). Therefore, our results favor a scenario in which 
the 5'-adenine preference of P4RNAs likely contributes to the 
5'-adenine preference of Pol IV siRNAs. In addition, our results 
provide a plausible explanation as to why AG04 evolved to 
bind siRNAs with 5' adenine. Taken together, the short size 
and 5'-A feature of Pol IV transcripts may help to channel their 
processing to DCL3 rather than dicer proteins in other silencing 
pathways, and thus lead to production of predominantly 24-nt 
siRNAs at Pol IV transcribed loci. 

Pol IV Transcription Preferentially Terminates at 
Methylated Cytosines with Misincorporated 
Nucleotides 

We analyzed the sequence composition of P4RNA reads with 
perfect match to genome and found enrichment for A, C, and 
U at last three positions of the 3' end (Figure S5C). It is not known 
whether Pol IV tends to cease transcription at this sequence, or 
whether it might transcribe a longer RNA that is then processed 
by an unknown endonuclease. We found a similar compositional 
bias at the 3' end of Pol IV-dependent siRNAs (Figure S5D), 
although the magnitude of the biases were lower for siRNAs, 
suggesting that P4RNAs are processed at some level at their 
3' ends to produce siRNAs. Despite the ACU enrichment at the 
3' end of siRNAs, they still show enrichment for 5' adenine, which 
is likely explained by the AG04 preference for loading siRNAs 
with a 5' adenine. These results provides additional evidence 
that the P4RNAs described here are indeed the precursors of 
siRNAs, and it is again consistent with (1) the shared stranded- 
ness of P4RNAS and siRNAs, and (2) the hypothesis that the 
Pol IV strand, rather than the RDR2 strand, is favored as the final 
SiRNA. 

We also performed an analysis in which we allowed multiple 
mismatches during genome mapping of P4RNAs. Interestingly, 
we found that more than half of the P4RNAs contained one or 
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Figure 5. Pol IV Transcription Preferentially Terminates at Methylated Cytosines with Misincorporated Nucleotides 

(A) Example of 3'-non-templated nucleotides on P4RNAs. The black bar represents 50 bp in length. 

(B) Length of the 3'-end non-templated nucleotides, defined by the first mismatched nucleotide to the last nucleotide. If a P4RNA has no mismatch, the length 
is zero. 

(C) Di-nucleotide enrichment at the first mismatched position at the 3' end. 

(D) Frequency of CG dinucleotide on reference sequence over the P4RNAs with misincorporation, “-1 ” marks the last perfectly matched position, and “+1 ” marks 
the first mismatched position. The count of each CG is designated to the position of the G, therefore the peak at “+1 ” represents a peak of CG at “-1/+1 .” 
See also Figure S6. 



two non-templated nucleotides at their 3'-ends (Figures 5A and 
5B). In contrast, in the same PATH libraries, reads derived from 
Pol ll-transcribed coding regions, or from microRNA processing 
intermediates, had very few mismatches, and these mismatches 
were not localized to the 3' ends (Figures S6A and S6B). To 
further rule out the possibility that P4RNA 3' non-templated 
nucleotides are due to lower quality of sequencing toward the 
end of the read or result from incomplete trimming of adaptor 
sequence, we analyzed the second read (read2) from the 
paired-end sequencing. On read2, the beginning nucleotide 
corresponds to the 3' end nucleotide of RNA, where base quality 
is high and there is no trimming step involved, and we observed 
the same high-level of non-templated nucleotides at the 3'-end 
of P4RNA (Figure S6C). 

The 3' end base composition of P4RNAs containing non-tem- 
plated nucleotides was quite different from those with a perfect 
genome match (Figures S5C and S5E), suggesting that P4RNAs 
with non-templated nucleotides terminate by a different mecha- 
nism than those without. On the other hand, the 5' end base 
composition of reads with non-templated nucleotides was very 
similar to those without, suggesting that both classes may share 
a similar transcription initiation mechanism by Pol IV (Figures 
S5F and S5G). All four nucleotides were found among the non- 



templated nucleotides, although there was some preference 
for guanines (Figure S7A). In addition, we observed different 
preferences for non-templated nucleotides depending on the 
sequence that should have been present, suggesting that the 
preference for a particular non-templated nucleotide is deter- 
mined by the sequence of the DNA template for Pol IV (Figures 
S7B and S7C). Because all four nucleotides were present, and 
because different incorrect nucleotides were present depending 
on the template DNA sequence, it seems most likely that these 
nucleotides arise from misincorporation during Pol IV transcrip- 
tion rather than from the activity of a terminal transferase. 

Because misincorporation of nucleotides occurred most 
frequently at positions corresponding to guanines (which would 
be cytosines on the DNA template) (Figure 5A), we hypothesized 
that misincorporation might be caused by in part by cytosine 
DNA methylation. The most highly methylated sequences in 
the Arabidopsis genome are CG dinucleotides (Cokus et al., 
2008; Lister et al., 2008), and Pol IV siRNA loci targeted by 
RdDM are usually heavily methylated at most CG sites (Stroud 
et al., 2013) (Figure S4B). Agreeing with the DNA methylation 
hypothesis, we found that CG dinucleotides (G being the last 
nucleotide of the RNA) exhibited by far the highest enrichment 
of misincorporation among the 16 possible dinucleotide 
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Figure 6. Pol IV Transcription Preferentially Terminates at Methylated Cytosines with Misincorporated Nucleotides 

(A) CG hypomethylated DMRs in ddm1 compared to a wild-type control. 

(B) 3'-misincorporated nucleotides of P4RNA at ddm1 CG DMRs in ddm1 dcl3 compared to dcl3. 

See also Figure S7. 



sequences (Figure 5C). In addition, CG dinucleotides were 
strongly enriched at the 3'-end of P4RNAs that exhibited misin- 
corporation (Figure 5D). The second group of most commonly 
methylated sequences in the genome and at Pol IV siRNA loci 
are CHG sites (Figure S4B) (Cokus et al., 2008; Lister et al., 
2008). A trinucleotide analysis showed that all of the trinucleo- 
tides showing the strongest tendency for misincorporation 
were those that contained CG sites (Figure S7D). In addition, 
CHG sites showed a strong tendency for misincorporation, 
which was higher than AHG, THG, or GHG sites (Figure S7D). 

To directly test whether the loss of methylation can alter the 
pattern of 3' end misincorporation, we analyzed the ddm1 
(decrease in DNA methylation 1) mutant that exhibits a severe 
loss of methylation in heterochromatin (Matzke and Mosher, 
2014). Despite the significant loss of methylation, many silent 
loci are still producing 24-nt siRNAs in ddm1 (Colome-Tatche 
et al., 2012), suggesting that Pol IV is still largely functional 
in ddm1 . Because P4RNAs are elevated in dcl3 mutants, we 
utilized a small RNA dataset from ddm1 dcl3 double mutant 
(RNAs smaller than 200 nt) (McCue et al., 2015) and compared 
these with a similar dataset from the dcl3 single mutant. We first 
focused our analysis on a set of strong ddm1 hypomethylated 
CG DMRs (c/ifferentially methylated regions) (Figure 6A, Experi- 
mental Procedures) from whole-genome bisulfite sequencing 
data of ddm1 (Creasey et al., 2014). We found that the enrich- 
ment of P4RNA 3'-misincorporation at CG dinucleotides was 
eliminated in ddm1 dcl3 compared to the dcl3 single mutant 
(Figure 6B). We also examined trinucleotide enrichments in 
ddm1 dcl3 at a set of CHG DMRs (Figure S7E), and observed 
a significant reduction at CHG but not at AHG, THG, or GHG (Fig- 
ure S7F). Moreover, P4RNAs in ddm1/dcl3 were slightly longer 
than those in dcl3 single mutant (Figure S7G). Take together, 
these results suggest that DNA methylation itself is contributing 
to the pattern of 3'-misincorporation. 

If P4RNAS are the precursors of siRNAs and DCL3 can cleave 
to some extent from the 3'-end of P4RNA, siRNAs should also 
contain some level of misincorporated bases at their 3' ends. 
By allowing for mismatches during genome mapping, we indeed 
found that siRNAs contain non-templated nucleotides that were 
enriched at the 3' end (Figure S6D). However, the proportion of 
siRNAs with 3'-mismatches (~1 %) was far lower than that of 



P4RNAS (~50%, Figure 5B), suggesting that far fewer siRNAs 
are processed from the 3' ends of P4RNAs than from the 5' 
ends, possibly due to the preference of 5'-adenine by DCL3 
(Nagano et al., 2014). We also observed enrichment of CG sites 
at the 3' end of siRNAs that showed misincorporation (Fig- 
ure S6E), again consistent with some level of processing of the 
3' end of P4RNAs into siRNAs. 

Since DCL3 appears to process from both ends of the double- 
stranded Pol IV/RDR2-derived RNA, and since P4RNAs are 
enriched for adenines at their 5' ends and misincorporated 
nucleotides at their 3' ends (Figure 7A), two predictions are 
that siRNAs with 5' adenines should have lower than average 
3' misincorporation, and siRNAs with 3' misincorporation should 
have lower than average 5' adenine content. Indeed, we found 
that siRNAs with 5' adenines had 50% lower misincorporation 
than siRNAs with other 5' nucleotides (Figure 7B), and siRNAs 
with misincorporated nucleotides showed a lower 5' adenine 
content than those with perfect matches to the genome (Fig- 
ure 7C). These results further support that P4RNAs containing 
misincorporated 3' nucleotides are processed into siRNAs. In 
summary, our results support that three different mechanisms 
can contribute the formation of the 3' end of P4RNAs - termina- 
tion or 3' end processing at sequences enriched for ACU at 
the last three positions of the P4RNA, termination associated 
with misincorporation at methylated cytosines, and termination 
associated with misincorporation at other nucleotides. 

DISCUSSION 

The results of this study support the general scheme that 
P4RNAS are first transcribed by Pol IV, made double stranded 
by RDR2, and diced by DCL3 and other dicers to make the 
siRNAs which are loaded into AG04. DCL3 cleavage produces 
an siRNA duplex with symmetric structure (Matzke and Mosher, 
201 4), but only one strand (the guide strand) is retained in AG04, 
while the other strand (the passenger strand) is cleaved by AG04 
and degraded (Ye et al., 2012). Little is known about guide- 
strand selection of 24-nt siRNAs, and it remains unclear why, 
at many loci, siRNAs predominately match to one strand, and 
direct strand-specific methylation (Zhong et al., 2014). Our anal- 
ysis revealed that the strand biases for P4RNAs and siRNAs 
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Figure 7. “One Precursor, One siRNA” Model for the Biogenesis 
of Pol IV Dependent 24-nt siRNAs 

(A) Pol IV transcription is initiated at Pol ll-like TSSs. A short RNA of -^30 to 40 nt 
(with 5'-adenine preference) is produced by Pol IV at heterochromatic regions; 
misincorporation then occurs at the cytosine position (red bar) and terminates 
Pol IV transcription. This process typically yields a P4RNA with 5' adenine and 3' 
misincorporation. P4RNA then goes through processing that involves RDR2 
synthesizing the complementary strand (gray), DCL3 cutting from the 5' end 
(major) or 3' end (minor) of the P4RNA, and loading of the P4RNA-derived siRNA 
strand into AG04 as the guide-strand. Eventually one P4RNA precursor gives 
rise to one siRNA that is derived from either its 5' or 3' end. (“M” underneath 
the DNA template indicates DNA methylation at the heterochromatic region.) 

(B) When DCL3 cuts from the 5' end of P4RNAs, the resulting siRNAs are more 
likely to carry a 5' adenine and be perfectly matched. Indeed, a reduction of 
3' mismatches is observed for siRNAs with a 5' adenine. 

(C) When DCL3 cuts from the 3' end of P4RNAs, the resulting siRNAs are 
more likely to carry the 3' misincorporation and less likely the 5' adenine. This 
is consistent with the observation that siRNAs with mismatch have lower 
percentage of 5' adenine. 

See also Figure S5. 



are highly positively correlated (Figures 1 D and S1 D), suggesting 
that the siRNAs derived from the Pol IV strand rather than the 
RDR2-synthesized complementary strand, are favored as the 
guide-strand siRNA. This bias appears to be partially accom- 
plished by a strong bias for adenine to be present as the first 
nucleotide of P4RNAs, by the preference of DCL3 for 5' adenines 
for siRNA processing, and by the preference of AG04 for loading 
siRNAs with a 5' adenine (Figure 7A). P4RNAs also appear to be 
processed to some extent at their 3' ends to yield siRNAs that 
have similar 3' end signatures as are present in P4RNAs, 
including either an enrichment for ACU sequences, or misincor- 
porated nucleotides. In addition, Pol IV transcripts are so short 
that on average only one siRNA will arise from each P4RNA. 
Finally, the short nature of Pol IV transcripts, coupled with the 
preference of DCL3 for short dsRNAs, may serve to channel 
Pol IV/RDR2 products into DCL3 rather than other dicers, such 
as DCL4 that prefers long dsRNAs (Nagano et al., 2014). These 
results imply a “one precursor, one siRNA” model for processing 
of siRNAs from Pol IV to RDR2 to DCL3 to AG04 (Figure 7A). 
Based on the patterns of TSSs, strand-bias and 3'-misincorpora- 
tion we conclude that the majority of PATH reads at siRNA loci 
are P4RNA, but it is still possible that some of these PATH 
reads are RDR2-transcribed as they are genetically dependent 
on both RDR2 and Pol IV. 

Because Pol IV transcripts are so short, the sites of Pol IV 
transcriptional initiation and termination are largely determining 
the positions of siRNAs in the genome. Production of short 
transcripts by Pol IV could provide for siRNA biogenesis while 
avoiding the risk of transcribing full-length transposable ele- 
ments. In addition, the short length of Pol IV transcripts may 
help to prevent spreading of RdDM to flanking regions, and allow 
specific silencing of transposons that are close to genes. 

Our proposed model for biogenesis of 24-nt siRNAs in 
Arabidopsis share characteristics with that of 21U-RNAs (or 
piRNAs) in C. elegans. 21U-RNAs are 21 -nt in length and begin 
with a 5' uridine, and they are autonomously expressed from 
thousands of loci dispersed in two broad regions of chromo- 
some IV (Batista et al., 2008; Ruby et al., 2006). Like P4RNAs, 
precursors of 21U-RNAs, which are ~26 nucleotide Pol II tran- 
scripts, are terminated by an unknown mechanism to produce 
unusually short transcripts (Gu et al., 2012). However, unlike 
P4RNAS, 21U-RNA precursors are processed at their 5' ends 
to remove two nucleotides, and transcription units usually 
contain a conserved motif 42-nt upstream of the mature piRNA 
(Batista et al., 2008; Gu et al., 201 2; Ruby et al., 2006). In contrast 
we did not find evidence for sequence conservation upstream of 
P4RNA start sites (Figure S4D), and instead Pol IV appears to be 
recruited by epigenetic signals such as the binding of H3K9 
methylation through the Pol IV interacting protein SHH1 (Law 
et al., 2013). Furthermore, while there is clear evidence for 
transposon-derived secondary siRNA production through 
RNA-dependent RNA Polymerase (RdRP) activity in C. elegans 
(22G-RNA) (Lee et al., 2012), as well as Zucchini-dependent, 
secondary phased piRNAs in mammals (Han et al., 2015; 
Mohn et al., 201 5), there is little evidence to support the possibil- 
ity of secondary siRNA production in the RdDM pathway. 

The 5' ends of P4RNAs are very similar in sequence composi- 
tion to that of Pol II transcripts, showing strong enrichment for 
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Y/R dinucleotides at the -1/+1 positions. However, as opposed 
to Pol II transcripts that begin with trimethylguanosine caps, the 
majority of P4RNAs have 5' monophosphates. Since RNA poly- 
merases normally start transcribing with a triphosphate contain- 
ing nucleotide, it is unclear how Pol IV transcripts acquire a 
5' monophosphate. It is possible that Pol IV is able to initiate 
transcription with a nucleoside monophosphate as previously re- 
ported for some RNA polymerase (Martin and Coleman, 1989; 
Ranjith-Kumar et al., 2002), or the P4RNAs may be initially cap- 
ped after which a de-capping enzyme converts the trimethylgua- 
nosine cap to a 5' monophosphate, or an unknown polyphos- 
phatase-like enzyme may directly convert the 5'-triphosphate 
to a 5'-monophosphate. Additionally, since 5' monophosphate 
containing RNAs are often a target of 5' to 3' exonucleases, there 
may be mechanisms to protect P4RNA 5' ends until subsequent 
processing steps are completed. It is also unclear why P4RNA 
evolved to have 5' monophosphates rather than cap structures, 
but one possibility is that this helps the cell avoid inadvertently 
mistaking a P4RNA for a Pol II transcript in order to avoid trans- 
lation of transposon RNAs. 

Our observations suggest that a component of the mechanism 
by which Pol IV transcription terminates is that DNA methylation 
on the template strand causes misincorporation of inappropriate 
bases. The mechanism by which this happens is unclear, but it 
is known that DNA methylation can cause transcriptional elonga- 
tion defects in Neurospora (Rountree and Selker, 1997). The 
preferential termination of Pol IV transcription near DNA methyl- 
ation would promote siRNA generation near sites of preexisting 
DNA methylation, thereby creating a self-reinforcing loop in 
which siRNAs direct DNA methylation targeting and DNA methyl- 
ation helps direct the location of further siRNA production. 

EXPERtMENTAL PROCEDURES 

Biological Materials 

The mutant alleles oinrpd1-4, nrpb2-3, rdr2-1, nrp(dle)2, rdr2ldc\3, and com- 
binations of dc\2l3l4 used in this study were in the background of Arabidopsis 
thaliana ecotype Columbia-0 and have been previously described (Henderson 
et al., 2006; Li et al., 2008; Rentier et al., 2005; Xie et al., 2004; Zheng et al., 
2009). The dcl3/nrpd1 double mutant was obtained by crossing dcl3-1 with 
nrpd1-4. Plants were grown in a growth chamber with 16 hr of light or green- 
house condition for five weeks. Immature inflorescence tissues including 
inflorescence meristem and early stages floral buds (up to stage 11/12) were 
collected. 

Sequencing of Small RNA, PATH mRNA, and BS-Seq Libraries 

Total RNA was first treated with RiboMinus Plant Kit for RNA-Seq (Invitrogen 
Al 0838-08) to remove rRNA, followed by size selection of RNA on a 15% 
UREA TBE Polyacrylamide gel (Invitrogen, EC6885BOX). Gels containing 
RNA with size between 15- to 27-nt were kept for small RNA library, while 
gels containing 28- to ^300-nt RNA were kept for PATH library. After gel 
elution, library construction for both sRNA and PATH was done using the 
lllumina TruSeq Small RNA Sample Preparation Kit (RS-200-0012), except 
that at the final size selection step, PCR products were separated on a 
6% TBE Polyacrylamide gel (Invitrogen, EC6265BOX) and selected for the 
range from 120- to ~1 ,000-bp for PATH library. Gel-eluted PCR products 
with different TruSeq index sequences were pooled and sent for lllumina 
sequencing. The mRNA library was constructed using the lllumina TruSeq 
RNA Sample Preparation Kit (RS-1 22-2001 ) according to the standard manual. 
PATH libraries were sequenced using either paired-end mode with length of 
readi being 120-bp and length of read2 being 30-bp (PEI 20+30), or single- 
end 100-bp (SE100); while sRNA and mRNA libraries were sequenced with 



single-end 50-bp (SE50). For BS-seq, DNA was isolated using the DNeasy 
Plant Mini Kit (QIAGEN #69104) according to manufacturer instructions and 
quantified using the Qubit dsDNA High Sensitivity Kit (Life Technologies 
#032851). Libraries were constructed with 30ng DNA using the Ovation 
Ultralow Methyl-Seq Library Systems (NuGEN #0335). Bisulfite conversion 
was done using the EpiTect Bisulfite Kit (QIAGEN # 59104). BS-seq Libraries 
were sequenced at a length of 50 bp. All sequencing was carried out on 
lllumina HiSeq machines at the Broad Stem Cell Research Center (BSCRC) 
sequencing core at University of California, Los Angeles. 

Real-Time RT-PCR and RNA Gel Blots 

Real-time RT-PCR to detect P4RNAs in various genotypes was performed as 
the following: Total RNA were extracted from 100 mg of flowers using Trizol 
(Ambion) with an extra step of 24:1 Chloroform: isoamyl alcohol to remove 
remaining phenol prior to isopropanol precipitation, the resuspended RNAs 
were cleaned up using Quick-RNA miniprep (Zymo research, USA) for further 
purification and complete gDNA removal. Then 5ug of the purified RNA from 
each sample was used for RT reaction with Superscript III first-strand kit 
(Invitrogen, USA). luL of the RT reaction was used for real-time PCR using 
iO SYBR green supermix (Biorad, USA). We used terminator exonuclease 
(Epicenter, USA) for the removal of RNA with 5'-monophosphate: 5ug of total 
RNA were treated for an hour at 30 degree with Terminator, and the control, 
non-treated, RNA with the same buffer and 50% glycerol instead of terminator 
exonuclease enzyme. After Terminator treatment the RNA were used for RT 
reaction as described above. Primer information can be found in Table S3. 

RNA gel blotting was performed as previously described (Henderson et al., 
2006) with LNA probes for detecting transposon regions and regular RNA 
probes for detecting 5S locus. Probe information can be found in Table S3. 

Pol II ChIP-Seq and AG04-RIP 

ChIP-seq was performed as described previously (Johnson et al., 2014). 5 ^ig 
Pol II antibodies (Abeam #ab817) were used for each ChIP. Libraries for Pol II 
ChIP-seq were generated using the Ovation Ultralow DR Multiplex System 
(NuGen #0330) and sequenced at a length of 50 bp. ChIP-seq data were visu- 
alized using ngsplot (Shen et al., 2014). 

AG04-RIP was performed as previously described (Ji et al., 201 1) with com- 
mercial AG04 antibody (Agrisera #AS09 61 7). RNA isolated from RIP was used 
for library construction with lllumina TruSeq Small RNA Sample Preparation Kit 
(RS-1 22-2001) according to the standard manual. 

Bioinformatic Analysis 
Data handling 

In general, qseq files received from the sequencing core were demultiplexed 
with an in-house Perl script and converted to fastq files for downstream anal- 
ysis. For small RNA and PATH data, original reads were first trimmed using 
Cutadapt (v1 .4), then mapped to the reference TAIR10 genome using Bowtie 
(Langmead et al., 2009) allowing only one unique hit (-m 1). We allowed zero 
mismatch for small RNA mapping (-v 0) and up to three mismatches for 
PATH mapping (-v 3). mRNA data were mapped using Tophat (Trapnell 
et al., 2009) allowing two mismatches and only one unique hit. 

Analysis of siRNAs and P4RNAs 

We used a list of Pol IV dependent siRNA loci that is previously described (Law 
et al., 2013) and manually inspected and filtered out a few loci that are tRNA 
related (listed in Table S2). In brief, these are 200 base pair bins where 
siRNAs were significantly reduced in a Pol IV mutant compared to two repli- 
cates of wild-type controls (FRD < 10“^°). More details can be found in the 
“Identification of siRNA clusters” section of Methods in the Law et al. paper 
(Law et al., 2013). Our definition of P4RNAs is any 27+ nt PATH reads derived 
from these ~7,000 previously defined Pol IV dependent 24-nt siRNA loci. 
For the abundance calculation, sRNA reads with the length between 18 and 
26 nt, and PATH reads with the length greater or equal to 27 nt were included 
(Table SI). 5' -Adenine preference was analyzed by calculating the nucleotide 
composition at each position (counting from 5'end) for Pol IV siRNAs and 
P4RNAS. Percentage of 3'-non-templated nucleotides in P4RNAs was calcu- 
lated at each position by analyzing the mapping results with a custom Perl 
script with the focus on the “MD:Z” column in sam format. The length of the 
3'-misincorporation was done using the seed mapping option of bowtie to 
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map the first 24 nt of P4RNA perfectly and then allow up to ten mismatches 
for 3' portion. This analysis confirmed that majority of P4RNAs carry one or 
two 3'-misincorporated nucleotides. Therefore we chose to allow three mis- 
matches for P4RNA mapping for all analysis. 

Normalization of P4RNA and sRNA Abundance 

Abundances of P4RNA in each library were normalized to the total number of 
sequenced reads, and abundances of sIRNAs were normalized to the sum of 
all miRNAs in each library. For example, in Figure 2B, in Col PATH library we 
obtained 161,241,523 reads in total and of which 63,739 were classified as 
sasRNA (27+ nt), while in dcl2/3/4 PATH library we obtained 89,226,121 reads 
in total and of which 4,283,063 were classified as sasRNA. Therefore the per- 
centage of P4RNA in dcl2/3/4 compared to Col is calculated as percentage = 
(4,283,063/89,226,121) 7(63,739/161 ,241 ,523) = 121.43 ^120-fold. 

BS-Seq Analysis and DMR Calling 

We used public data of floral ddml and wild-type BS-seq (GSE52346), ddml/ 
dcl3 sRNA-seq (GSE57191), and dcl3 sRNA-seq (GSE62801). Analysis of the 
floral BS-seq libraries of wild-type and ddm1 mutant (Creasey et al., 2014) 
were performed using BSMAP (Xi and Li, 2009), allowing only uniquely map- 
ped reads and discarded sibling PCR products, with the tolerance of 2 mis- 
matches per 50 bp. DMR calling was performed as previously described 
(Stroud et al., 2013), with a more stringent criteria: 1) sum of all sequenced 
cytosines in the 100 bp bin need to be at least 100; 2) the difference in CG 
methylation at each bin needs to be at least 0.5. This filtering allows us to focus 
on regions that are not only highly methylated in wild-type but also lose methyl- 
ation dramatically in the ddm1 mutant. CHG DMRs were filtered with similar 
criteria, with the minimal loss of CHG in ddm1 compared to wild-type being 
0.4, and the count of covered cytosines in each bin no less than 50. 
Nucleotide Composition and Enrichment 

P4RNAS from a defined set of loci were measured for their 3'-end sequence 
composition. For those P4RNAs that contain the 3' misincorporation, different 
compositions of the reference sequence at the first mismatched nucleotide 
(mono-nucleotide), the last matched plus first mismatched (di-nucleotide), or 
the last two matched plus first mismatched (tri-nucleotide) were counted as 
the observed value. The mono-, di- or tri-nucleotide compositions of the 
DMA sequence (or K-mer) were calculated using Jellyfish (Margais and King- 
sford, 201 1) and considered as expected value assuming P4RNAs end equally 
likely with any nucleotide. The observed value and expected value were both 
normalized to their own population, and the ratio of observed/expected is used 
to show the relative enrichment of certain mono-, di- or tri-nucleotide 
composition. 

ACCESSION NUMBERS 

The accession number for the sequencing data reported in this paper is GEO: 
GSE61439. 

SUPPLEMENTAL INFORMATION 

Supplemental Information includes seven figures and three tables and can be 
found with this article online at http://dx.doi.Org/10.1016/j.cell.2015.09.032. 

AUTHOR CONTRIBUTIONS 

J.Z., S.B., S.F., I.A, I.R.H., X.C., and C.T. performed experiments. T.-f.L, S.F., 
and J.G-B. provided materials. S.Y.P., L.L., and X.C. participated in the ge- 
netic experiments. S.E.J. and B.C.M. oversaw the study. J.Z., H.W., and 
W.L. analyzed data. J.Z. and S.E.J. designed the study and wrote the 
manuscript. 

ACKNOWLEDGMENTS 

We thank members of the Jacobsen lab for insightful discussions. We thank 
Molly Megraw for suggestions on TSS analysis and Mahnaz Akhavan for tech- 
nical assistance. We thank Dr. Jeffrey A. Long and Jared Sewell for their advice 
and guidance in performing ChIP-seq experiments and preparing genomic li- 
braries. J.Z. thanks Dr. Chomdao Chommy and Seksan Sapsubbsakul for 



encouragement. High-throughput sequencing was performed at the UCLA 
BSCRC BioSequencing Core Facility. This work was supported by NIH grant 
GM60398 to S.E.J and NSF award 1051576 to B.C.M. S.B. is supported by 
a postdoctoral fellowship of the Swiss National Science Foundation. J.G-B 
is a Human Frontiers Science Program fellow (LT000425/2012-L). J.Z. is a 
Life Science Research Foundation postdoctoral fellow, sponsored by the Gor- 
don and Betty Moore Foundation. S.E.J. is an Investigator of the Howard 
Hughes Medical Institute. 

Received: June 5, 2015 
Revised: August 14, 2015 
Accepted: September 1 1 , 2015 
Published: October 8, 2015 

REFERENCES 

Batista, P.J., Ruby, J.G., Claycomb, J.M., Chiang, R., Fahlgren, N., Kasschau, 

K. D., Chaves, D.A., Gu, W., Vasale, J.J., Duan, S., et al. (2008). PRG-1 and 
21U-RNAS interact to form the piRNA complex required for fertility in 
C. elegans. Mol. Cell 31, 67-78. 

Cokus, S.J., Feng, S., Zhang, X., Chen, Z., Merriman, B., Haudenschild, C.D., 
Pradhan, S., Nelson, S.F., Pellegrini, M., and Jacobsen, S.E. (2008). Shotgun 
bisulphite sequencing of the Arabidopsis genome reveals DNA methylation 
patterning. Nature 452, 215-219. 

Colome-Tatche, M., Cortijo, S., Wardenaar, R., Morgado, L., Lahouze, B., Sar- 
azin. A., Etcheverry, M., Martin, A., Feng, S., Duvernois-Berthet, E., et al. 
(2012). Features of the Arabidopsis recombination landscape resulting from 
the combined loss of sequence variation and DNA methylation. Proc. Natl. 
Acad. Sci. USA 109, 16240-16245. 

Creasey, K.M., Zhai, J., Borges, F., Van Ex, F., Regulski, M., Meyers, B.C., and 
Martienssen, R.A. (2014). miRNAs trigger widespread epigenetically activated 
siRNAs from transposons in Arabidopsis. Nature 508, 411-415. 

Cumbie, J.S., Ivanchenko, M.G., and Megraw, M. (2015). NanoCAGE-XL and 
CapFilter: an approach to genome wide identification of high confidence tran- 
scription start sites. BMC Genomics 16, 597. 

Gu, W., Lee, H.C., Chaves, D., Youngman, E.M., Pazour, G.J., Conte, D., Jr., 
and Mello, C.C. (2012). CapSeq and CIP-TAP identify Pol II start sites and 
reveal capped small RNAs as C. elegans piRNA precursors. Cell 151, 1488- 
1500. 

Haag, J.R., Ream, T.S., Marasco, M., Nicora, C.D., Norbeck, A.D., Pasa-Tolic, 

L. , and Pikaard, C.S. (2012). In vitro transcription activities of Pol IV, Pol V, and 
RDR2 reveal coupling of Pol IV and RDR2 for dsRNA synthesis in plant RNA 
silencing. Mol. Cell 48, 811-818. 

Han, B.W., Wang, W., Li, C., Weng, Z., and Zamore, P.D. (2015). Noncoding 
RNA. piRNA-guided transposon cleavage initiates Zucchini-dependent, 
phased piRNA production. Science 348, 817-821 . 

Havecker, E.R., Wallbridge, L.M., Hardcastle, T.J., Bush, M.S., Kelly, K.A., 
Dunn, R.M., Schwach, F., Doonan, J.H., and Baulcombe, D.C. (2010). The 
Arabidopsis RNA-directed DNA methylation argonautes functionally diverge 
based on their expression and interaction with target loci. Plant Cell 22, 
321-334. 

Henderson, I.R., Zhang, X., Lu, C., Johnson, L., Meyers, B.C., Green, P.J., and 
Jacobsen, S.E. (2006). Dissecting Arabidopsis thaliana DICER function in 
small RNA processing, gene silencing and DNA methylation patterning. Nat. 
Genet. 38, 721-725. 

Herr, A.J., Jensen, M.B., Dalmay, T., and Baulcombe, D.C. (2005). RNA poly- 
merase IV directs silencing of endogenous DNA. Science 308, 118-120. 

Ji, L, Liu, X., Yan, J., Wang, W., Yumul, R.E., Kim, Y.J., Dinh, T.T., Liu, J., Cui, 
X., Zheng, B., et al. (2011). ARGONAUTE10 and ARGONAUTE1 regulate the 
termination of fioral stem cells through two microRNAs in Arabidopsis. PLoS 
Genet. 7, el 001 358. 

Johnson, L.M., Du, J., Hale, C.J., Bischof, S., Feng, S., Chodavarapu, R.K., 
Zhong, X., Marson, G., Pellegrini, M., Segal, D.J., et al. (2014). SRA- and 



454 Cell 163, 445-455, October 8, 2015 ©2015 Elsevier Inc. 




Cell 



SET-domain-containing proteins link RNA poiymerase V occupancy to DNA 
methyiation. Nature 507, 124-128. 

Kanno, T., Huettei, B., Mette, M.F., Aufsatz, W., Jaiigot, E., Daxinger, L., Kreii, 
D.P., Matzke, M., and Matzke, A.J. (2005). Atypicai RNA poiymerase subunits 
required for RNA-directed DNA methyiation. Nat. Genet. 37, 761-765. 
Langmead, B., Trapneii, C., Pop, M., and Saizberg, S.L (2009). Uitrafast and 
memory-efficient aiignment of short DNA sequences to the human genome. 
Genome Biol. 10 , R25. 

Law, J.A., and Jacobsen, S.E. (2010). Establishing, maintaining and modifying 
DNA methyiation patterns in plants and animals. Nature rev. 11 , 204-220. 
Law, J.A., Vashisht, A.A., Wohlschlegel, J.A., and Jacobsen, S.E. (2011). 
SHH1, a homeodomain protein required for DNA methyiation, as weii as 
RDR2, RDM4, and chromatin remodeiing factors, associate with RNA poiy- 
merase iV. PLoS Genet. 7, el 0021 95. 

Law, J.A., Du, J., Haie, C.J., Feng, S., Krajewski, K., Paianca, A.M., Strahi, 

B. D., Patei, D.J., and Jacobsen, S.E. (2013). Polymerase IV occupancy at 
RNA-directed DNA methyiation sites requires SHH1. Nature 498 , 385-389. 
Lee, H.C., Gu, W., Shirayama, M., Youngman, E., Conte, D., Jr., and Meiio, 

C. C. (2012). C. elegans piRNAs mediate the genome-wide surveiiiance of 
germiine transcripts. Celi 150 , 78-87. 

Li, C.F., Henderson, I.R., Song, L., Fedoroff, N., Lagrange, T., and Jacobsen, 
S.E. (2008). Dynamic reguiation of ARGONAUTE4 within muitipie nuciear 
bodies in Arabidopsis thaiiana. PLoS Genet. 4 , e27. 

Li, S., Vandivier, L.E., Tu, B., Gao, L, Won, S.Y., Li, S., Zheng, B., Gregory, 

B. D., and Chen, X. (2015). Detection of Poi iV/RDR2-dependent transcripts 
at the genomic scaie in Arabidopsis reveais features and reguiation of siRNA 
biogenesis. Genome Res. 25, 235-245. 

Lister, R., O’Maiiey, R.C., Tonti-Fiiippini, J., Gregory, B.D., Berry, C.C., Miiiar, 
A.H., and Ecker, J.R. (2008). Highly integrated singie-base resolution maps of 
the epigenome in Arabidopsis. Ceii 133 , 523-536. 

Margais, G., and Kingsford, C. (2011). A fast, iock-free approach for efficient 
paraiiei counting of occurrences of k-mers. Bioinformatics 27, 764-770. 
Martin, C.T., and Coieman, J.E. (1989). T7 RNA poiymerase does not interact 
with the 5'-phosphate of the initiating nucieotide. Biochemistry 28 , 2760-2762. 
Matzke, M.A., and Mosher, R.A. (2014). RNA-directed DNA methyiation: an 
epigenetic pathway of increasing compiexity. Nature reviews 15 , 394-408. 
McCue, A.D., Panda, K., Nuthikattu, S., Choudury, S.G., Thomas, E.N., and 
Siotkin, R.K. (2015). ARGONAUTE 6 bridges transposabie eiement mRNA- 
derived siRNAs to the estabiishment of DNA methyiation. EMBO J. 34 , 20-35. 
Mi, S., Cai, T., Hu, Y., Chen, Y., Hodges, E., Ni, F., Wu, L, Li, S., Zhou, H., Long, 

C. , et ai. (2008). Sorting of smali RNAs into Arabidopsis argonaute complexes 
is directed by the 5' terminai nucieotide. Ceii 133 , 116-127. 

Mohn, F., Handler, D., and Brennecke, J. (2015). Noncoding RNA. piRNA- 
guided siicing specifies transcripts for Zucchini-dependent, phased piRNA 
biogenesis. Science 348 , 812-817. 

Nagano, H., Fukudome, A., Hiraguri, A., Moriyama, H., and Fukuhara, T. 
(201 4). Distinct substrate specificities of Arabidopsis DCL3 and DCL4. Nucieic 
Acids Res. 42 , 1845-1856. 

Nechaev, S., Fargo, D.C., dos Santos, G., Liu, L., Gao, Y., and Adelman, K. 
(2010). Giobai anaiysis of short RNAs reveais widespread promoter-proximai 
stalling and arrest of Poi ii in Drosophiia. Science 327 , 335-338. 



Onodera, Y., Haag, J.R., Ream, T., Costa Nunes, P., Pontes, O., and Pikaard, 
C.S. (2005). Piant nuciear RNA poiymerase iV mediates siRNA and DNA 
methylation-dependent heterochromatin formation. Ceii 120 , 613-622. 

Rentier, D., Yahubyan, G., Vega, D., Bulski, A., Saez-Vasquez, J., Hakimi, 
M.A., Lerbs-Mache, S., Coiot, V., and Lagrange, T. (2005). Reinforcement of 
siiencing at transposons and highiy repeated sequences requires the 
concerted action of two distinct RNA poiymerases iV in Arabidopsis. Genes 
Dev. 19 , 2030-2040. 

Ranjith-Kumar, C.T., Gutshaii, L., Kim, M.J., Sarisky, R.T., and Kao, C.C. 
(2002). Requirements for de novo initiation of RNA synthesis by recombinant 
flavivirai RNA-dependent RNA poiymerases. J. Virol. 76 , 12526-12536. 

Ream, T.S., Haag, J.R., Wierzbicki, A.T., Nicora, C.D., Norbeck, A.D., Zhu, 
J.K., Hagen, G., Guilfoyle, T.J., Pasa-Tolic, L, and Pikaard, C.S. (2009). Sub- 
unit compositions of the RNA-siiencing enzymes Pol IV and Pol V reveal their 
origins as speciaiized forms of RNA poiymerase ii. Moi. Ceii 33 , 192-203. 

Rountree, M.R., and Seiker, E.U. (1997). DNA methyiation inhibits eiongation 
but not initiation of transcription in Neurospora crassa. Genes Dev. 11 , 
2383-2395. 

Ruby, J.G., Jan, C., Piayer, C., Axteii, M.J., Lee, W., Nusbaum, C., Ge, H., and 
Bartei, D.P. (2006). Large-scaie sequencing reveals 21U-RNAs and additional 
microRNAs and endogenous siRNAs in C. eiegans. Ceii 127 , 1193-1207. 

Shen, L., Shao, N., Liu, X., and Nestier, E. (2014). ngs.piot: Quick mining and 
visuaiization of next-generation sequencing data by integrating genomic data- 
bases. BMC Genomics 15 , 284. 

Stroud, H., Greenberg, M.V., Feng, S., Bernatavichute, Y.V., and Jacobsen, 
S.E. (2013). Comprehensive anaiysis of siiencing mutants reveals complex 
regulation of the Arabidopsis methyiome. Ceii 152 , 352-364. 

Trapneii, C., Pachter, L., and Saizberg, S.L. (2009). TopHat: discovering spiice 
junctions with RNA-Seq. Bioinformatics 25, 1 1 05-1 111. 

Xi, Y., and Li, W. (2009). BSMAP: whoie genome bisuifite sequence MAPping 
program. BMC Bioinformatics 10 , 232. 

Xie, Z., Johansen, L.K., Gustafson, A.M., Kasschau, K.D., Leliis, A.D., Ziiber- 
man, D., Jacobsen, S.E., and Carrington, J.C. (2004). Genetic and functionai 
diversification of smaii RNA pathways in piants. PLoS Bioi. 2, El 04. 

Yamamoto, Y.Y., ichida, H., Matsui, M., Obokata, J., Sakurai, T., Satou, M., 
Seki, M., Shinozaki, K., and Abe, T. (2007). identification of piant promoter con- 
stituents by anaiysis of iocai distribution of short sequences. BMC Genomics 
8 , 67. 

Ye, R., Wang, W., iki, T., Liu, C., Wu, Y., Ishikawa, M., Zhou, X., and Qi, Y. 
(2012). Cytopiasmic assembiy and seiective nuciear import of Arabidopsis 
Argonaute4/siRNA complexes. Mol. Cell 46 , 859-870. 

Zheng, B., Wang, Z., Li, S., Yu, B., Liu, J.Y., and Chen, X. (2009). intergenic 
transcription by RNA poiymerase ii coordinates Pol IV and Pol V in siRNA- 
directed transcriptionai gene siiencing in Arabidopsis. Genes Dev. 23 , 2850- 
2860. 

Zhong, X., Du, J., Haie, C.J., Gaiiego-Bartoiome, J., Feng, S., Vashisht, A.A., 
Chory, J., Wohischiegei, J.A., Patei, D.J., and Jacobsen, S.E. (2014). Moiecu- 
iar mechanism of action of piant DRM de novo DNA methyitransferases. Ceii 
157 , 1050-1060. 



Cell 163, 445-455, October 8, 2015 ©2015 Elsevier Inc. 455 




Resource 



Cell 

Reconstruction and Simulation of Neocortical 
Microcircuitry 



Graphical Abstract 






Acquiring experimental data 



In silico reconstruction of cellular, and synaptic anatomy and physiology 



Simulation revealed a spectrum of activity states 



Authors 

Henry Markram, Eilif Muller, 

Srikanth Ramaswamy, 

Michael W. Reimann Javier DeFeiipe, 

Sean L Hiii, Idan Segev, Felix Schurmann 

Correspondence 

henry.markram@epfl.ch 

In Brief 

A digital reconstruction and simuiation of 
the anatomy and physioiogy of 
neocortical microcircuitry reproduces an 
array of in vitro and in vivo experiments 
without parameter tuning and suggests 
that celiuiar and synaptic mechanisms 
can dynamically reconfigure the state of 
the network to support diverse 
information processing strategies. 



Highlights 

• The Blue Brain Project digitaiiy reconstructs and simuiates a 
part of neocortex 

• Interdependencies aiiow dense in silico reconstruction from 
sparse experimentai data 

• Simulations reproduce in vitro and in vivo experiments 
without parameter tuning 

• The neocortex reconfigures to support diverse information 
processing strategies 



Markram et al., 2015, Cell 163, 456-492 
CrossMark October 8, 2015 ©2015 Elsevier Inc. 

http://dx.d 0 i. 0 rg/l 0.1 01 6/j.cell.201 5.09.029 



CelPress 



Cell 



Resource 



Reconstruction and Simulation 
of Neocortical Microcircuitry 

Henry Markram,^’2,i9* Eiiif Muller,^ Srikanth Ramaswamy,^’^^ Michael W. Reimann,^’^^ Marwan Abdellah,^ 

Carlos Aguado Sanchez,^ Anastasia Ailamaki,^® Lidia Alonso-Nanclares,®-^ Nicolas Antille,^ Selim Arsever, 

Guy Antoine Atenekeng Kahou,^ Thomas K. Berger, ^ Ahmet Bilgili,^ Nenad Buncic,^ Athanassia Chalimourda,^ 
Giuseppe Chindemi,^ Jean-Denis Courcol,^ Fabien Delalondre,^ Vincent Delattre,^ Shaul Druckmann,^-^ 

Raphael Dumusc,^ James Dynes, ^ Stefan Eilemann,^ Eyal Gal,^ Michael Emiel Gevaert,^ Jean-Pierre Ghobril,^ 

Albert Gidon,^ Joe W. Graham,^ Anirudh Gupta,^ Valentin Haenel,^ Etay Hay,^ ^ jhomas Heinis,^’^®’^^ Juan B. Hernando,^ 
Michael Hines,^^ yda Kanari,^ Daniel Keller,^ John Kenyon,^ Georges Khazen,^ Yihwa Kim,^ James G. King,^ 

Zoltan Kisvarday,^^ Pramod Kumbhar,^ Sebastian Lasserre,^’^^ Jean-Vincent Le Be,^ Bruno R.C. Magalhaes,^ 

Angel Merchan-Perez,®>^ Julie Meystre,^ Benjamin Roy Morrice,^ Jeffrey Muller,^ Alberto Muhoz-Cespedes,®’^ 

Shruti Muralidhar,2 Keerthan Muthurasa,^ Daniel Nachbaur,^ Taylor H. Newton,^ Max Nolte,^ Aleksandr Ovcharenko,^ 
Juan Palacios,^ Luis Pastor,^ Rodrigo Perin,^ Rajnish Ranjan,^ ^ \pr\a6 Riachi,^ Jose-Rodrigo Rodriguez,®’^ 

Juan Luis Riquelme,^ Christian Rossert,^ Konstantinos Sfyrakis,^ Ying Shi ,^’2 Julian C. Shillcock,^ Gilad Silberberg,^^ 
Ricardo Silva,^ Farhan Tauheed,^ ’^® Martin Telefont,^ Maria Toledo-Rodriguez,^^ Thomas Trankler, Werner Van Geit,^ 
Jafet Villafranca Diaz,^ Richard Walker,^ Yun Wang,^^’^^ Stefano M. Zaninetta,^ Javier DeFelipe,®’^’ 2 o Sean L. Hill,^’ 2 o 

Idan Segev,3’4,2o gucl Felix Schurmann^ ^o 

■■Blue Brain Project, Ecole polytechnique federale de Lausanne (EPFL) Biotech Campus, 1202 Geneva, Switzerland 
^Laboratory of Neural Microcircuitry, Brain Mind Institute, EPFL, 1015 Lausanne, Switzerland 

^Department of Neurobiology, Alexander Silberman Institute of Life Sciences, The Hebrew University of Jerusalem, Jerusalem 91904, Israel 
"^The Edmond and Lily Safra Center for Brain Sciences, The Hebrew University of Jerusalem, Jerusalem 91904, Israel 
^Janelia Farm Research Campus, Howard Hughes Medical Institute, Ashburn, VA 20147, USA 

^Laboratorio Cajal de Circuitos Corticales, Centro de Tecnologia Biomedica, Universidad Politecnica de Madrid, 28223 Madrid, Spain 
^Institute Cajal (CSIC) and CIBERNED, 28002 Madrid, Spain 

^CeSViMa, Centro de Supercomputacion y Visualizacion de Madrid, Universidad Politecnica de Madrid, 28223 Madrid, Spain 
^Modeling and Virtual Reality Group, Universidad Rey Juan Carlos, 28933 Mostoles, Madrid, Spain 

■■OKey Laboratory of Visual Science and National Ministry of Health, School of Optometry and Opthalmology, Wenzhou Medical College, 
Wenzhou 325003, China 

■'■'Caritas St. Elizabeth’s Medical Center, Genesys Research Institute, Tufts University, Boston, MA 02111, USA 

■■^Department of Neurobiology, Yale University, New Haven, CT 06510 USA 

■■^MTA-Debreceni Egyetem, Neuroscience Research Group, 4032 Debrecen, Hungary 

■■^School of Life Sciences, University of Nottingham, Nottingham NG7 2UH, United Kingdom 

■■^Laboratoire d’informatique et de visualisation, EPFL, 1015 Lausanne, Switzerland 

■■^Data-lntensive Applications and Systems Lab, EPFL, 1015 Lausanne, Switzerland 

■■^Imperial College London, London SW7 2AZ, UK 

■■^Department of Neuroscience, Karolinska Institutet, Stockholm 17177, Sweden 
■■^Co-first author 
2°Co-senior author 

*Correspondence: henry.markram@epfl.ch 
http://dx.d 0 i. 0 rg/l 0.101 6/j.cell.201 5.09.029 



SUMMARY 

We present a first-draft digital reconstruction of the 
microcircuitry of somatosensory cortex of juvenile 
rat. The reconstruction uses cellular and synaptic 
organizing principles to algorithmically reconstruct 
detailed anatomy and physiology from sparse experi- 
mental data. An objective anatomical method defines 
a neocortical volume of 0.29 ± 0.01 mm^ containing 
~31,000 neurons, and patch-clamp studies identify 
55 layer-specific morphological and 207 morpho- 
electrical neuron subtypes. When digitally recon- 
structed neurons are positioned in the volume and 
synapse formation is restricted to biological bouton 
densities and numbers of synapses per connection, 



their overlapping arbors form million connections 
with --37 million synapses. Simulations reproduce 
an array of in vitro and in vivo experiments without 
parameter tuning. Additionally, we find a spectrum 
of network states with a sharp transition from synchro- 
nous to asynchronous activity, modulated by physio- 
logical mechanisms. The spectrum of network states, 
dynamically reconfigured around this transition, sup- 
ports diverse information processing strategies. 

INTRODUCTION 

Since Santiago Ramon y Cajal’s seminal work on the neocortex 
(DeFelipe and Jones, 1988; Ramon y Cajal, 1909, 1911), avast 
number of studies have attempted to unravel its multiple levels 



456 Cell 163, 456-492, October 8, 2015 ©2015 Elsevier Inc. 



CrossMark 




Cell 



of anatomical organization (types of neurons, synaptic connec- 
tions, layering, afferent and efferent projections within and be- 
tween neocortical regions, etc.) and functional properties 
(neuronal response characteristics, synaptic responses and 
plasticity, receptive fields, functional neocortical columns, emer- 
gent activity maps, interactions between neocortical regions, 
etc.). However, there are still large gaps in our knowledge, espe- 
cially concerning the anatomical and physiological organization 
of the neocortex at the cellular and synaptic levels. 

Specifically, while neurons have been classified in terms of 
their electrophysiological behaviors (Connors and Gutnick, 
1990; Kasper et al., 1994; McCormick et al., 1985), expression 
of different calcium-binding proteins and neuropeptides (Cello, 
1986; DeFelipe, 1993; Gonchar and Burkhalter, 1997; Kawagu- 
chi and Kubota, 1997; Toledo-Rodriguez et al., 2005) and 
morphological features (Kisvarday et al., 1985; Larkman, 
1991a; Tamas et al., 1998; Wang et al., 2002), there is still no 
consensus on an objective and comprehensive classification of 
neuron types. Although the distribution of protein and genetic 
markers for different neurons (Grange et al., 2014; Hendry 
et al., 1989; Kawaguchi and Kubota, 1997; Meyer et al., 2002; 
Toledo-Rodriguez et al., 2004) and the relative proportions of 
some morphologically and electrically classified neurons (Beau- 
lieu and Colonnier, 1983; Cauli et al., 1997; Hendry et al., 1984; 
Meyer et al., 2010a; Rudy et al., 2011) have been described, 
we lack a comprehensive view of the number of each type of 
neuron in each layer. Since the advent of paired recording tech- 
niques, several studies have characterized the anatomical and 
physiological properties of synaptic connections between 
some types of neurons (Cobb et al., 1997; Feldmeyer et al., 
1999; Frick et al., 2008; Gupta et al., 2000; Mason et al., 1991; 
Reyes et al., 1998; Thomson et al., 1993), but a large proportion 
have yet to be studied. Although labeling with retrograde and 
anterograde tracers and trans-synaptic viral vectors, imaging 
with array tomography, and saturated reconstruction with elec- 
tron microscopy have made it possible to begin mapping pre- 
and postsynaptic neurons for individual neocortical neurons 
(Boyd and Matsubara, 1991; Callaway, 2008; Glenn et al., 
1982; Kasthuri et al., 2015; Killackey et al., 1983; Micheva and 
Smith, 2007; Micheva et al., 2010; Wickersham et al., 2007), 
we know neither the numbers and types of the pre- and postsyn- 
aptic neurons associated with any specific neuron type nor the 
numbers and locations of the synapses that they form with their 
immediate neighbors. 

At a functional level, there have been many investigations of 
emergent behavior in neocortical slices (Cunningham et al., 
2004; Mao et al., 2001; McCormick et al., 2003; Sanchez-Vives 
and McCormick, 2000; Yuste et al., 1997), correlated activity 
(Hasenstaub et al., 2005; Livingstone, 1996; Salinas and Sej- 
nowski, 2001; Shu et al., 2003; Silberberg et al., 2004; Singer, 
1993), and the functional impact of individual neurons across 
cortical layers (Sakata and Harris, 2009; Schroeder and Foxe, 
2002; Silva et al., 1991; Steriade et al., 1993), as well as in vivo 
activity in somatosensory and other cortical areas (Chen et al., 
2015; Klausberger et al., 2003; Leinekugel et al., 2002; Luczak 
et al., 2007; Reyes-Puerta et al., 201 5; Wilson et al., 201 2), How- 
ever, we still lack an understanding of the cellular and synaptic 
mechanisms and the role of the different layers in the simplest 



of behaviors, such as correlated and uncorrelated single-neuron 
activity and, more generally, synchronous and asynchronous 
population activity. For example, it is known that different types 
of neurons are connected through synapses with different dy- 
namics and strengths, strategically positioned at different loca- 
tions on the neurons’ dendrites, somata, and axons, but the 
functional significance of this organization remains unclear. 
Computational approaches that abstract away this level of bio- 
logical detail have not been able to explain the functional signif- 
icance of such intricate cellular and synaptic organization. 
Although future experimental research will undoubtedly 
advance our knowledge, it is debatable whether experimental 
mapping alone can provide enough data to answer these 
questions. 

Here, we present a complementary algorithmic approach that 
reconstructs neuronal microcircuitry across all layers using avail- 
able sparse data and that leverages biological principles and in- 
terdependencies between datasets to predict missing biological 
data. As a test case, we digitally reconstructed a small volume of 
tissue from layers 1 to 6 of the hind-limb somatosensory cortex 
of 2-week-old Wistar (Han) rat. This model system was chosen 
not only because it is one of the most comprehensively charac- 
terized in the neocortex, but also because experimental data on 
its cellular and synaptic organization are readily available and 
validation experiments are relatively easy to perform. In brief, 
we recorded and digitally reconstructed neurons from in vitro 
brain slices and classified the neurons in terms of well-estab- 
lished morphological types (m-types; Figure 1A), positioned 
the neurons in a digital volume of objectively defined dimensions 
according to experimentally based estimates of their layer spe- 
cific densities (Figure IB), and reconstructed the connectivity 
between the neurons (Figure 1C). Neurons were then classified 
into electrical types (e-types), using an extended version of the 
classification proposed in the Petilla convention (Ascoli et al., 
2008), and models were produced that captured the character- 
istic electrical behavior of each type. (Figure 1 D); similarly, syn- 
apses were modeled to capture the characteristic synaptic dy- 
namics and kinetics of particular synapse types (s-types; 
Figure IE). Finally, we constructed a virtual slice and recon- 
structed thalamic input using experimental data (Figure IF; 
Meyer et al., 2010b). 

This approach yielded a first-draft digital reconstruction of 
the microcircuitry, which was validated against a multitude of 
experimental datasets not used in the reconstruction. The results 
suggest that it is possible to obtain dense maps of neural micro- 
circuitry without measuring every conceivable biological param- 
eter and point to minimal datasets required, i.e., strategic data. 
Integrating complementary, albeit sparse, datasets also makes 
it possible to reconcile discrepancies in the literature, at least 
partially addressing the problem of data quality and reproduc- 
ibility. Simulations exploring some of the emergent behaviors 
of the reconstructed microcircuitry reproduce a number of 
previous in vitro and in vivo findings and provide insights into 
the design and functioning of neocortical microcircuitry. The 
experimental data, the digital reconstruction, and the simulation 
results are available at the Neocortical Microcircuit Collaboration 
Portal (NMC Portal; https://bbp.epfl.ch/nmc-portal; see Ram- 
aswamy et al., 2015). 
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Figure 1 . Workflow for Data-Driven Reconstruction of Neocortical Microcircuitry 

(A) Morphological diversity of neurons, (a) Identify the morphological diversity in the neocortical microcircuit (m-types). (b) Repair and then clone the various m- 
types with statistical variations to enrich the number of exemplars. 

(B) Microcircuit anatomy, (a) Define the spatial dimensions of a unitary microcircuit, (b) Assemble individual neurons in 3D space according to the frequency of 
occurrence of each m-type per layer, selecting the appropriate m-type instance that satisfies laminar constraints on the axonal and dendritic distribution. 

(C) Reconstructing microcircuit connectivity. Derive the number and location of synaptic contacts formed between all neurons in the microcircuit, based on a 
series of synaptic connectivity rules. 

(D) Electrical diversity of neurons. Map and model the electrical types (e-types) of each m-type to account for the observed diversity of morpho-electrical 
subtypes (me-types). 

(E) Synaptic diversity of neurons. Map and model the diversity of synaptic types (s-types) observed between pre-post combinations of me-types, according to 
rules derived from synaptic physiology. 

(F) Reconstructing virtual tissue volumes. Apply the above strategy to reconstruct defined circuit volumes (microcircuits, slices, mesocircuits) for in silico ex- 
periments; insert synapses formed by thalamocortical fibers for stimulation experiments. 



RESULTS 

Neuron-type Nomenclature 

Neurons differ in terms of their location in the brain, morphology, 
electrical properties, projections, and the genes and proteins 
that they express (for reviews, see Harris and Shepherd, 2015; 
Markram et al., 2004). The combination of these properties 
implies an immense diversity of neuron types. Given the lack of 
sufficient data for other dimensions, the neuronal classification 
used for this first-draft digital reconstruction considered only 
layer, local morphology, and electrophysiology. Naming of 
morphological types was based on the most common names 
used over the past century (Connors and Gutnick, 1990; DeFe- 
lipe, 1993; DeFelipe et al., 2013; Douglas and Martin, 2004; 



Fairen et al., 1984; Hestrin and Armstrong, 1996; Kawaguchi 
and Kubota, 1997; Kisvarday et al., 1985; Oberlaender et al., 
2012; Somogyi et al., 1982, 1998; Svoboda et al., 1997; Szaba- 
dics et al., 2006), extended with a layer prefix (e.g., Layer_Mor- 
phology, L5_MC for layer 5 Martinotti cells). Electrical types, 
based on the Petilla convention (Ascoli et al., 2008), were treated 
as subtypes, (e.g., L5_MC_NAC for the non-accommodating 
subtype; see Experimental Procedures). When whole-brain 
axonal tracing data for a sufficient number of projecting neurons 
becomes available (e.g., L5_TTPC_CP and L5_TTPC_CT to 
represent cortico-pontine and cortico-tectal subtypes; Hallman 
et al., 1 988; Wang and McCormick, 1 993; for a review, see Ram- 
aswamy and Markram, 201 5), the proposed classification can be 
extended to include projection subtypes. Similarly, when there 
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Large Basket Cell 
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Small Basket Cell 
Chandelier Cell 



Pyramidal Cell 
Star Pyramidal Cell 
Spiny Stellate Cell 



Thick-tufted Pyramidal Cell with 
a late bifurcating apical tuft 
Thick-tufted Pyramidal Cell with 
an early bifurcating apical tuft 
Untufted Pyramidal Cell 
Slender-tufted Pyramidal Cell 
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Tufted Pyramidal Cell with 
dendritic tuft terminating in layer 1 
Pyramidal Cell with inverted 
apical-like dendrites 
Pyramidal Cell with bipolar 
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Figure 2. Table of Neocortical Neuronal Morphologies 

Exemplar 3D reconstructions of 55 m-types. Morphologies in L2 and L3 are not separated. Axon in blue, dendrites in red. Full morphologies are not always shown. 
See also Figure S1 for average arbor densities of each m-type and Figure S2 for objective classification of m-types and details of the morphology cloning process. 
See also Movie S1A. 



are sufficient single-cell gene and protein expression data to 
systematically identify cells, it can be extended to include molec- 
ular subtypes. The abbreviations used for each m-type are 
provided in Figure 2. A mapping between the nomenclature 
used in this study and alternative names present in the literature 
is provided in Table 1 . 

Morphological Diversity of Neocortical Neurons 

We recorded and labeled >14,000 neurons from all six layers in 
the somatosensory cortex of PI 4 male Wistar (Han) rats, using 
patch-clamp electrodes in in vitro slices. Of these neurons, 
2,052 were sufficiently well stained to allow expert classification 
into m-types, based on well-established characteristic features 
of their dendritic and axonal arbors, a procedure initiated by early 
neuroanatomists and still in use today (Fairen et al., 1984; Kara- 
giannis et al., 2009; Karube et al., 2004; Kawaguchi and Kubota, 
1997; Kisvarday et al., 1985; Larkman, 1991a; Perrenoud et al., 
2013; Peters and Kaiserman-Abramof, 1970; Ramon y Cajal, 
1 909, 1911; Somogyi et al., 1 982, 1 998; Wang et al., 2004; Yuste, 
2005). We were able to digitally reconstruct a subset of 1 ,009 of 
these neurons. This allowed validation of the expert classifica- 
tion using an objective method (see below) based on clustering 
of characteristic features and provided the initial pool of digital 
neuron models needed to reconstruct the microcircuitry. In a 
few cases, we had no morphological reconstructions for rare 
m-types known to be present in the microcircuitry (L5_BP, 
L5_ChC, L6_NGC; Olah et al., 2007; Szabadics et al., 2006). 
These were represented using exemplars of the same mor- 
phology from neighboring layers. Although L6 horizontal and 



sub-plate pyramidal cells (L6_HPC and L6_SPC) were present 
in the dataset and have also been reported in the literature 
(Ghosh and Shatz, 1993; Hevner et al., 2001), the quality of the 
stains was not sufficient for reliable reconstruction. These mor- 
phologies are not represented in the first draft. 

Aggregating morphological reconstructions and reports in the 
literature, we distinguished 55 m-types (65 if layers 2/3 are 
considered separately and 67 if L6_HPC and L6_SPC are also 
considered; Figure 2). Inhibitory types are mostly distinguished 
by axonal features and excitatory types by dendritic features 
(for reviews, see Markram et al., 2004; Ramaswamy and Mark- 
ram, 201 5; Spruston, 2008). Figure SI shows overlays of multiple 
exemplars of each of the 55 major m-types, and Figures S2A and 
S2B illustrate the objective classification. While in some cases, 
it might have been possible to introduce a finer separation be- 
tween m-types, this would have limited the size of the samples 
for individual types, reducing the reliability of the classification. 

The same inhibitory types were present in all layers except 
layer 1 , which contained a unique set of inhibitory neuron types. 
Pyramidal cell morphologies varied across layers (Figure 2, right) 
and also with depth within layer, as illustrated by the diversity of 
L23_PCs (Figure 2, upper-right). The number of pyramidal cell 
types, as defined by their local morphology, increased from 
upper to lower layers. Several types of interneurons (e.g., LBC 
and DBC) had axonal arbors that tended to descend to deeper 
layers when they were in upper layers and to ascend to upper 
layers when they were in deeper layers. Consistent with this 
trend, one type of pyramidal cell (L6_IPC) also had inverted 
axonal arbors. 
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Table 1. Relation of Interneuron Classes to Classification Schemes Found in the Literature 



Morphological 

Type 


Neurogliaform 
Cell (NGC) 


Small Basket 
Cell (SBC) 


Double Bouquet 
Cell (DBG) 


Bipolar Cell (BP) 


Martinotti Cell (MC) 


Bitufted Cell (BTC) 


Large Basket 
Cell (LBC) 


Nest Basket 
Cell (NBC) 


Chandelier 
Cell (ChC) 


Other 

morphological 

classifications 


Dwarf cell, 
button-type cell 


Clutch cell 


Bitufted cell/ 
interneuron, 
horse-tail cell 


Bitufted cell/ 
interneuron 


Bitufted cell/ 
interneuron 


Bitufted 

interneuron 


Common 
basket cell, 
typical 
basket cell 


Willow cell, 
arcade cell, 
shaft-biased 
cell, atypical 
basket cell 


Axo-axonic cell 


Predominantly 
expressed 
Ca^^-binding 
proteins 
and peptides 


CB(-), NPY(+), 
PV (-), VIP (-), 
CR (-) SOM (-) 


CB(++), NPY(+), 
PV (-), VIP (+++), 
CR (-) SOM (++) 


CB(+), NPY(-), 
PV(-), VIP (+++), 
CR(+) SOM(++) 


CB(-), NPY(-), 
PV (-), VIP (+++), 
CR(++) SOM(++) 


CB(++), NPY(++), 
PV (-), VIP (-), 

CR (-) SOM (+++) 


CB(++), NPY(+), 
PV (-), VIP (+), 
CR (++) SOM (++) 


CB (++), NPY (+), 
PV (+++), VIP(+), 
CR (+) SOM (-) 


CB (++), NPY (+), 
PV (+++), VIP(+), 
CR (++) SOM (-) 


CB(+), NPY(-), 
PV(-), VIP(-), 
CR(-) SOM(-) 


Electrical 

types 


bNAC (7%), 
cNAC (79%), 
cSTUT (7%), 
cAC (7%) 


bNAC (36%), 
cAC (36%), 
dNAC (29%) 


bAC cAC 
(9%), (9%), 

bIR cIR 

(37%), (18%), 
bNAC cNAC 
(9%), (9%) 

bSTUT 
(9%) 


bAC cAC 

(7%), (29%), 

bIR cNAC 

(14%), (14%), 

bNAC dSTUT 
(29%) (7%) 


bAC cNAC 

(37%), (3%), 

bIR cSTUT 

(11%), (3%), 

bSTUT dNAC 
(4%), (3%) 

cAC 
(37%) 


bAC 

(17%), 

cAC 

(67%), 

cNAC 

(17%) 


bAC cNAC 

(6%), (17%), 

cAC dNAC 

(12%), (17%), 

cIR dSTUT 

(6%) (24%) 


bAC cIR 

(6%), (7%), 

bIR cNAC 

(6%), (20%), 

bSTUT cSTUT 

(13%), (20%), 

cAC dSTUT 

(20%) (7%) 


cAC 

(38%), 

cNAC 

(38%), 

dNAC 

(25%) 


Other 

electrical 

classifications 


Non-fast spiking, 
late spiking 


Fast spiking, 
non- 
accommodating, 
non-adapting 


Irregular spiking, 
regular spiking 
non-pyramidal, 
adapting 


Late spiking, 
regular spiking 
non-pyramidal, 
adapting 


Regular spiking 
non-pyramidal, 
burst spiking 
non-pyramidal, 
low threshold 
spiking 


Regular spiking 
non-pyramidal, 
adapting, 
burst spiking 
non-pyramidal 


Fast spiking, 
non- 
accommodating, 
non-adapting 


Fast 

spiking, non- 
accommodating, 
non-adapting 


Fast spiking, 
late spiking, 
non-adapting 



The terms used in this paper are in the first row, followed by other common names in the literature. Interneurons can be categorized according to which primary marker they express (calcium- 
binding proteins: parvalbumin [PV], calbindin [CB], and calretinin [CR]; neuropeptides: somatostatin [SOM], vasoactive intestinal polypeptide [VIP], neuropeptide Y [NPY], and cholecystokinin 



[CCK]). The mapping to serotonergic receptors (5HT3 aR) is not included since this was not assayed in the RT-PCR. We assign several possible electrical types to each morphological type, based 
on the Petilla convention, and show other names frequently used in the literature. See Figure 4 for definitions of electrical types. 
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Figure 3. Neuron Densities and Composition and Microcircuit Dimensions. 

(A) Neuron densities and numbers. Verticai thicknesses as determined by transitions in neuronai somata size and density in NeuN stained siices (six animais; 
mean + SD). Neuron densities and numbers (six animais; mean ± SD). 

(B) Neuron fractions. Confocai biock imaging of duai immunohistochemicai iabeiing. DAPi iabeis aii ceiis (biue). NeuN iabeis aii neurons (green), GABA iabeis aii 
GABAergic ceiis inciuding giia (red), duai GABA and NeuN iabeis oniy GABAergic neurons (green). Bars to the right show fractions of excitatory (red) and inhibitory 
(biue) neurons in each iayer. 

(C) m-type composition. Fractions of inhibitory (ieft) and excitatory (right) m-types per iayer (n = 2052). 

(D) Dimensions. The horizontai dimension was defined as the smaiiest circie required to attain maximai dendritic voiume at a centrai minicoiumn (brown, top); cut- 
off radius, 95% of the piateau voiume (r = 21 0 |am, middie). To aiiow tiiing, the circie was transformed into a hexagon, preserving the area. For m-type acronyms, 
see Figure 2. 

See aiso Figure S3 for detaiis on morphoiogy piacement and Figure S4 for vaiidation of the composition. See aiso Movie SI B. 



Using multiple exemplars obtained from different animals for 
each m-type, we developed a repair process to recover arbors 
cut during the slicing process, which was validated using in vivo 
reconstructed neurons (see Experimental Procedures; Anwar 
et al., 2009). To generate an even larger pool of unique morphol- 
ogies, we cloned multiple exemplars of each m-type (Figures 
S2C-S2F), jittering branch angles, and section lengths in the 
clones (see Experimental Procedures). The morphometric 
properties of the resulting population were validated against 
distributions of features obtained from reconstructed neurons 
(see Experimental Procedures). This approach allowed us to 
establish a dataset of neuronal morphologies (see Movie S1A) 
that respects biological variability. Software applications for 
repairing and cloning in vitro neuron morphologies and for auto- 
mated classification of neurons into the 55 m-types are available 
through the NMC Portal. 

Reconstructing Neuron Densities, Ratios, and 
Composition 

Reconstruction began by specifying the dimensions of the micro- 
circuit, the fractions of excitatory and inhibitory neurons, the 
proportions of each m-type, and the number of neurons of each 



m-type. The height of the neocortex and heights of each layer 
were measured experimentally in six animals, yielding an average 
overall height of 2,082 ± 80 microns (mean ± SD; n = 6; Figure 3A). 
Layer thicknesses were determined experimentally by measuring 
the location of transitions in cell densities and soma sizes in 
NeuN-stained tissue blocks (see Experimental Procedures). Frac- 
tions of excitatory and inhibitory neurons per layer (E-l fractions) 
were established by counting cells stained for DAPI (all cells), 
NeuN (all neurons), and GABA (all inhibitory neurons) in tissue 
blocks (Figure 3B; see Experimental Procedures). Overall, excit- 
atory and inhibitory neurons represented 87% ± 1 % and 13% ± 
1 % of the population, respectively, with a trend toward higher 
fractions of excitatory neurons in deeper layers (Figure 3B). 

The m-type composition for all excitatory and all inhibitory 
neurons in each layer was obtained from the relative frequencies 
of each m-type in the experimental dataset of 2,052 classified 
neurons mentioned earlier (Figure 3C; see Experimental Proce- 
dures). It is not possible to exclude sampling bias in this dataset. 
However, since E-l fractions were obtained in an unbiased 
manner, any bias is restricted to the proportions of m-types 
within the excitatory and inhibitory neurons and does not affect 
the overall E-l balance. 
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The E-l fractions and m-type composition determined in this 
way are broadly consistent with previous reports (DeFelipe 
et al., 2002; Lefort et al., 2009). For example, it is well established 
that ~50% of inhibitory interneurons are basket cells (i.e., LBCs 
and NBCs— predominantly parvalbumin-positive cells; SBCs— 
predominantly vasoactive intestinal peptide (VlP)-positive cells; 
we found ~53%, see below), that Martinotti cells (i.e., predomi- 
nantly somatostatin-positive cells; we found ~22%, see below) 
are frequent in all layers except L1 , and that bitufted and bipolar 
cells (i.e., many of the calbindin and calretinin-positive cells) 
and double bouquet cells (i.e., many of the VIP-positive cells) 
are both found in layers 2-6. Other inhibitory interneuron types 
are also found in L2-L6 but less frequently (Kawaguchi and 
Kubota, 1997; Khmer et al., 2005; Meyer et al., 2011; Olah 
et al., 2007; Sancesario et al., 1998; Somogyi et al., 1998; for a 
review, see Markram et al., 2004). Previously published neuron 
densities could not be used because they varied by a factor of 
two (40,000-80,000 neurons/mm^; Beaulieu, 1993; Cragg, 
1967; DeFelipe et al., 2002; Keller and Carlson, 1999; Peters, 
1987) and are too low to account for the number of synapses 
in the microcircuit (see below, “Digital Reconstruction of 
Connectivity”). We therefore performed new experiments, 
counting cells in NeuN-stained tissue blocks. The experiments 
yielded a mean cell density of 108,662 ± 2,754 neurons/mm^ 
(mean ± SEM, n = 6; see Experimental Procedures), comparable 
to observations in rat barrel cortex (Meyer et al., 2010a). Neuron 
densities were highest in L4 (Figure 3A), consistent with previous 
studies (Meyer et al., 2010a). 

Since hind-limb somatosensory cortex, unlike barrel cortex, 
has no anatomically defined horizontal columnar organization 
(Horton and Adams, 2005; Markram, 2008), we chose to define 
the radius of the microcircuit by placing reconstructed neurons 
in a cylindrical volume and determining the minimal radius where 
the density of dendrites saturates at the center (Figure 3D; 95% 
of the plateau value obtained at a radius of 210 |im; see Experi- 
mental Procedures). We chose dendrites, as opposed to axons, 
because they only arborize locally. This convention, which yields 
a minimal radius that reflects saturated dendritic density along 
the central axis, could allow comparisons between microcircuits 
in different brain regions. It yields a radius similar to the horizontal 
extent of the dendrites of the largest neuron in the microcircuit 
(i.e., the L5_TTPC; for a review, see Ramaswamy and Markram, 
201 5) and is comparable with the dimensions of the barrels in the 
rodent barrel cortex (Meyer et al., 2010b; Wimmer et al., 2010). 
To allow tiling of multiple microcircuits while minimizing edge ef- 
fects, the volume of the microcircuit was defined as a hexagonal 
prism (Figure 3D, bottom) with a cross-sectional area equal to 
that of the circle with the radius defined above and a height 
determined by the combined height of the layers. 

With these densities, m-type composition, and circuit dimen- 
sions, we calculated the number of each m-type in each layer 
and in the whole microcircuit. To approximate inter-individual 
variation in layer dimensions and neuronal densities, we digitally 
reconstructed separate microcircuits corresponding to layer 
heights and densities measured in five animals (Bio1-Bio5). 
The five reconstructions had an average of 31 ,375 ± 2,251 neu- 
rons (mean ± SD, n = 5), with the number of neurons increasing in 
each layer from LI to L6. We then constructed an additional 



microcircuit using the averaged data (BioM). To assess the vari- 
ation introduced by the digital reconstruction process (stochas- 
tic variations in m-type composition, selection and positioning 
of model neurons, and synaptic connectivity [see below]), we 
reconstructed seven instances of each microcircuit (i.e., seven 
reconstructions each from Bio1-Bio5 and seven from BioM; 42 
in total). 

Positioning Morphologically Reconstructed Neurons 

After establishing the dimensions of the microcircuit and the 
number of neurons belonging to each m-type in each layer, it 
was necessary to position each neuron in the digital reconstruc- 
tion. Consistent with reports of weak minicolumnar organization 
in rodents, (Mountcastle, 1998), neurons were arranged in 310 
minicolumns at horizontal positions drawn from 2D Gaussians 
around the center of each minicolumn, thus relaxing the strict- 
ness of the minicolumnar organization (see Experimental Proce- 
dures). The positions of the neurons along the vertical axis of the 
minicolumn were randomly chosen within each layer, using a 
space-filling algorithm to ensure that somata did not overlap 
(see Experimental Procedures). 

Once the positions of the neurons were established, a second 
algorithm randomly selected a suitable morphology for each po- 
sition from the top 8% of morphologies, scored by their match to 
typical patterns of arborization within and across layers (Fig- 
ure S3; see Experimental Procedures). These patterns were 
manually annotated on each reconstructed neuron, based on 
the depth of the recorded neuron within each layer and cross- 
layer arborization patterns described in the literature (see Exper- 
imental Procedures and NMC Portal). Figure S4A illustrates the 
microcircuit at this stage of reconstruction (see also Movie 
SI B). The total lengths of axons and dendrites in the average 
microcircuit were 350 ± 4 m and 215 ± 3 m (mean ± SD, n = 7), 
respectively. 

Biological accuracy at this stage of the reconstruction was 
validated against two experimental datasets that had not been 
used thus far. The first tissue-level dataset provides in vitro 
immunohistochemical staining of 30 |im sections for seven 
markers (calcium-binding proteins and neuropeptides) com- 
monly used to label inhibitory interneurons (Figure S4B). The 
second cellular-level dataset provides estimated probabilities 
that the genes for these markers are expressed in specific 
m-types (Toledo-Rodriguez et al., 2005; Wang et al., 2002, 
2004). We used the second dataset to add the markers to the 
model neurons. We then performed in silico immunohistochem- 
ical staining of the whole reconstructed tissue for each marker 
separately and compared the in silico stains against immunohis- 
tochemical stains from the first dataset. Although gene expres- 
sion data are noisy and genes do not translate equally to protein 
levels, we found a reasonable correspondence between the 
numbers of neurons at different depths stained for specific 
markers in the in silico and the in vitro stains (regression, r = 
0.65; Figure S4C). Furthermore, the layer-dependent pattern of 
in silico stained cells was consistent with previous staining ex- 
periments in this brain region (Ascoli et al., 2008; Conde et al., 
1994; DeFelipe, 1993; Dumitriu et al., 2007; Gentet et al., 2010, 
2012; Gonchar and Burkhalter, 1997; Gonchar et al., 2007; 
Kawaguchi and Kondo, 2002; Kawaguchi and Kubota, 1993, 
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Figure 4. Table of Morpho-Electrical Neuron Types 

(A) e-types. Diverse firing patterns in response to depoiarizing step current injections in neocorticai neurons, c, continuous; d, deiayed; b, bursting. AC, 
accommodating; NAC, non-accommodating; STUT, stuttering, iR, irreguiar; AD, adapting. 

(B) An exempiar neuron (L23NBC) with a diversity of e-types. Percentages indicate the reiative frequency of e-type occurrence. 

(C) Fractions of e-types (1 1 e-types) recorded experimentaiiy in each of the 55 m-types, making up 207 me-types. Soiid iines indicate iayer boundaries. SeeTabie 
1 for reiation of e-types to other ciassifications in the iiterature. 



1997; McGarry et al., 2010; O’Connor et al., 2009; Packer and 
Yuste, 2011; Santana et al., 2013; see also NMC portal). The 
observed correspondence would be unlikely in the presence of 
major errors in neuron densities, m-type composition, or posi- 
tioning of reconstructed neurons. However, the biological data 
are highly variable, and the validation of the inhibitory m-type 
composition used only a small proportion of markers reported 
in the literature. The reconstruction should thus be considered 
as a first draft, to be refined as it is challenged with additional 
markers. 

Morpho-Electrical Composition 

We applied a standardized battery of stimulation protocols 
(Le Be et al., 2007; Wang et al., 2002, 2004) to >3,900 neurons 
from all layers, recording and analyzing their responses. The 
neurons were classified using quantified features of the neuronal 
response to step current pulses, according to the criteria estab- 
lished by the Petilla convention (Ascoli et al., 2008; Figure 4A, 
top), with the exception of stuttering cells, which were consid- 
ered as a separate class (see Druckmann et al., 2013). 
Since no significant bursting behavior was observed in excit- 
atory m-types from animals of the age used in this study, all 
excitatory m-types were classified as continuous adapting 
(cAD) neurons (Figure 4A, bottom). Using this feature-based 



classification scheme, we identified 11 e-types (10 inhibitory 
e-types and 1 excitatory e-type) (Figure 4A; see Experimental 
Procedures). Objective clustering of the same features produced 
a similar classification, validating the original classification 
scheme (Druckmann et al., 201 3). The fact that the e-types iden- 
tified in this way have characteristic ion channel profiles provides 
further evidence for their distinctive identity (Khazen et al., 201 2; 
Toledo-Rodriguez et al., 2004). 

Most inhibitory m-types expressed multiple e-types (Fig- 
ure 4B), consistent with previous observations (Ascoli et al., 
2008; Cauli et al., 2000; Nelson, 2002; Toledo-Rodriguez et al., 
2005). Combining m- and e-types yielded 207 morpho-electrical 
types (me-types), providing an integrated view of the morpho- 
electrical diversity of the microcircuit (Figure 4C). A dataset of 
51 1 morphologically and electrically classified inhibitory neurons 
was used to determine the relative proportion of e-types for each 
inhibitory m-type (in a layer-dependent manner for m-types with 
sufficient samples and otherwise in a layer-independent manner; 
Figure 4C, color map; see Experimental Procedures). The rela- 
tive proportions were combined with neuron densities to calcu- 
late the number of neurons for each me-type in each layer. The 
resulting diversity and spatial distribution of inhibitory e-types 
is illustrated in Figure 5A. This integrated view of the micro- 
circuitry reveals that, at this age, the most common inhibitory 
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e-type is cAC, followed by cNAC and dNAC, and that stuttering 
and irregular e-types (cSTUT, bSTUT, dSTUT, cIR, and bIR) are 
relatively rare (Figure 5B). Inhibitory e-types with regular firing 
patterns (cAC, bAC, cNAC, bNAC, and dNAC) occur more 
frequently in superficial layers, whereas e-types with irregular 
firing patterns (cSTUT, bSTUT, dSTUT, cIR, bIR) are more com- 
mon in deep layers (Figure 5B). 

Digital Reconstruction of Connectivity 

We developed an algorithmic approach to reconstruct synaptic 
connectivity between neurons in a companion study (Reimann 
et al., 2015). The approach is based on five rules of connectivity 
described in the Experimental Procedures and validated in Re- 
imann et al. (2015). We implemented these rules in four stages 
that yield plausible multi-synapse connections, consistent with 
the rules and constrained by experimental bouton densities 
(Figure 6A). 

The algorithm predicts the characteristics of multi-synapse 
connections between pairs of neurons that belong to specific 
m-types (Figure 6B). We have previously shown that these pre- 
dictions faithfully reproduce detailed anatomical data on con- 
nectivity between L5 thick-tufted PCs (number of synapses 
and locations; Ramaswamy et al., 2012) and for a number of 



other connection types (synapse loca- 
tions; Hill et al., 2012). We now show 
that they reproduce the connectivity 
(numbers and locations) of all connection 
types that have been studied experimen- 
tally (see NMC Portal). For example, the 
anatomy of in silico synaptic connections 
between L5 Martinotti cells and L5 thick- 
tufted PCs (Figure S5) compares well with 
available experimental data (Silberberg 
and Markram, 2007). The algorithm pro- 
vides detailed anatomical predictions for 
connection properties, which it has not 
yet been possible to measure experimen- 
tally (e.g., numbers of source and target 
cells and synapses) (Figure S5). The 
reconstruction also allows studies of neu- 
rons involved in polysynaptic pathways 
(see NMC Portal) forming known motifs 
(Honey et al., 2007; Pehn et al., 2011; 
Silberberg, 2008; Sporns and Kotter, 
2004). 

The algorithm yields 1,941 biologically 
plausible multi-synapse connection types 
(out of a theoretical 3,025) that are con- 
sistent with the connectivity principles described above. Figure 7 
shows the predicted average number of synapses formed by 
each potentially viable connection type (Figure 7 A) as well as their 
predicted average connection probabilities (Figure 7B). The 
predicted number of synapses/connection is 4.5 ± 0.1 (3.6 for 
excitatory connections, 13.9 for inhibitory connections; n = 35). 
We also predict 27,625 types of connection between neurons 
of different me-types (see NMC Portal). 

On average, each neuron innervates 255 ± 13 other neurons 
belonging to 32% ± 1% of m-types, forming an average of 
1 ,145 ± 75 synapses per neuron present in the microcircuit (Fig- 
ure S6A; mean ± SD, across the 35 Biol -5 reconstructions; all 
neurons sampled). As a population, the neurons belonging to a 
given m-type innervate 63% ± 6% of the m-types in the microcir- 
cuit. The individual reconstructions (Biol -5) yield an average of 
638 ± 74 million appositions and 36.7 ± 4.2 million synapses 
(27.0 ± 2.9 million excitatory and 9.7 ± 1.5 million inhibitory). 
Taken together, the neurons of the microcircuit form 8.1 ± 0.9 
million connections. Figure 7C and Table SI provide a first 
view of the connectivity between neurons of the neocortical 
microcircuit. Analyzing these data, we find that, at this age, 
the fraction of excitatory synapses (red) increases from LI to 
L6 (Figure S6B). At later ages, this trend may change as axons 
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Figure 6. Reconstructing Connectivity 

(A) Four-step algorithm to convert putative axo-dendritic appositions into functional synapses. (1) Axonal appositions. For an exemplar L23SBC (left, soma and 
dendrites in black, axon in blue), connectivity based on all axo-dendritic appositions (in red) is characterized by an extremely wide distribution of synapses per 
connection and almost 100% connection probability (right, pooled data from efferent connections to L23PCs of n = 100 L23SBCs). (Inset) A selected axon 
collateral with all appositions. (2) After general pruning. For the same exemplar, L23SBC, randomly removing a fraction of appositions removes the right side of the 
distribution of synapses per connection (right). (3) After multi-synapse pruning. Removing connections formed by too few appositions prunes the left side of the 
distribution of synapses (right) but leaves short inter-bouton intervals. (4) After plasticity pruning. The last step randomly removes more connections (right), 
leading to correct inter-bouton-intervals and connection probabilities. 

(B) Examples of in silico multi-synapse connections resulting after the four-step apposition to synapse conversion algorithm. The pre- and postsynaptic m-types 
forming the synaptic connection are indicated. The presynaptic neuron is shown in yellow, postsynaptic neuron in black, and synaptic contacts as red circles. 



mature and reach higher layers. Pooling all excitatory and inhib- 
itory cells in each layer reveals that recurrent excitation increases 
with cortical depth while recurrent inhibition is weak in all layers, 
that descending interlaminar projections are stronger than 



ascending projections, and that intralaminar inhibition is weakest 
in layer 4 (Figure S6C). 

The seven statistical instantiations of the mean microcircuit 
(BioM) yield 636 ± 10 million appositions and 36.5 ± 0.5 million 
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Figure 7. Predicted Synapse Numbers and Connection Probabilities 

(A) Synapses per connection. A matrix of the average synapses per connection for multi-synapse connections formed between the 55 m-types (1 ,941 biologically 
viable connection types). 

(B) Connection probabilities. A matrix of average connection probabilities within 100 lam. 

(C) The connectome of the reconstructed microcircuit grouped by m-type (i.e., 1,941 m-type pathways). Colors group m-types by layer. Thickness of ribbon 
proportional to the number of synapses; inner ring segments, outputs (axons); outer ring segments, inputs (dendrites). 

See also Figure S5 for anatomical details of an exemplary pathway; Figure S6 and Table SI for more details of synaptic innervation strength; and Figure S20 for a 
comparison of the predicted connectome to a recent EM study. See also Movie SIC. 



synapses (25.8 ± 0.4 million excitatory and 10.6 ± 0.2 million 
inhibitory; n = 7; Table SI and Movie SIC). The lower variability 
of the statistical instantiations compared to the individual recon- 
structions (Bio1-Bio5; Table SI) indicates that the variation 
across digital reconstructions falls well within the bounds of bio- 
logical variability. 

From the space remaining on dendrites after accounting for 
predicted intrinsic connectivity (assuming 1.1 synapses/pm; 
Datwani et al., 2002; Kawaguchi et al., 2006; Larkman, 
1991b), we predict that afferent fibers from beyond the 
microcircuit (extrinsic synapses) form a further 147 + 4 million 
synapses (mean ± SD; n = 35) (Figures S6D and S6E). The total 



predicted number of synapses in the microcircuit is thus 184 ± 
6 million (mean ± SD; n = 35), of which only 20% ± 2% of 
synapses are formed by neurons belonging to the microcircuit 
(i.e., intrinsic synapses), consistent with previous estimates 
in neocortex (Stepanyants et al., 2009). In a parallel electron mi- 
croscopy study in which we determined average synapse den- 
sity (0.63 ± 0.1/pm^; mean ± SD; n = 25) and calculated the 
number of synapses in a comparable volume of the neocortex, 
we obtained 182 + 6 million synapses. On the assumption that 
the average number of synapses/connection is the same for 
afferent fibers as for excitatory connections within the microcir- 
cuit (3.6 + 0.04 synapses/connection; n = 35), we predict that 
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the microcircuit contains ~41 million mostly en passant afferent 
fibers. 

The reconstructed microcircuitry reproduces numerous other 
experimental findings that were not used in the reconstruction 
process, described in a companion paper (Reimann et al., 
2015). Nevertheless, it is clear that the predicted connectivity 
is a first draft that will be challenged and refined as experi- 
mental studies discover exceptions to the connectivity rules 
used here. 

Reconstructing Neuronal Physiology 

A series of algorithms and an automated workflow were devel- 
oped to configure NEURON models to reproduce the electro- 
physiology of each me-type, (Druckmann et al., 2007, 2011; 
Hay et al., 2011) (see Experimental Procedures). In brief, we 
selected a morphologically reconstructed neuron and distrib- 
uted Hodgkin-Huxley (HH)-type models of 13 known classes of 
ion channels (Figure S7) along the neuronal arbors (Figure 8A). 
Salient features were extracted from electrophysiological traces 
of e-type responses to step current pulses and data on back- 
propagating action potentials (Figure 8B; Larkum et al., 2001; 
Nevian et al., 2007). A multi-objective optimization algorithm 
(Druckmann et al., 2007) computed the vector of ion channel 
conductance densities that best reproduced features such as 
spike amplitudes and widths, spike frequency, and changes in 
frequency, and the resulting vector was transplanted into all neu- 
rons belonging to the m-type. Neurons in the resulting pool of 
models were challenged with a separate battery of stimuli not 
used to fit the vector of ion channel conductances. We then 
selected those that fell within observed distributions of features 
(~40% of models accepted; Figure 8C). This workflow provided 
a generic high-throughput method for modeling the electrical 
behavior of a potentially unlimited number of neurons of any 
e-type (Figure 8D). We automated the workflow to model all 
207 me-types (Figure 8E), generating a pool of 121,231 unique 
neuron models. Exemplars can be downloaded from the NMC 
portal together with NEURON models of each m-type with all 
of their intrinsic synapses (see Movie S2). Morpho-electrical 
variation in the ensemble of model neurons was comparable to 
the biological variation observed experimentally. The quality of 
the final selection was quantified by comparing model and bio- 
logical neurons in terms of their median z-scores for all electrical 
features (Figure 8E; see Experimental Procedures). 

The generalization power of these models has been demon- 
strated previously (Druckmann et al., 2011). As a further test, 
we compared dendritic attenuation of synaptic potentials in the 
models against past experiments (Berger et al., 2001; Nevian 
et al., 2007). While attenuation along basal dendrites (Figure S8; 
space constant, 40.0 ± 0.1 iim) was consistent with these results 
(Nevian et al., 2007), the reconstruction displayed stronger atten- 
uation along apical dendrites (Figure S8; 174.3 ± 0.4 [irni) than 
previously reported (273 iim; Berger et al., 2001). However, the 
data in the literature were obtained from adult animals whose 
apical dendrites have larger diameters (Zhu, 2000) than those 
of the animals used in this study. In a subset of model neurons 
whose apical dendrites had similar diameters to those of adult 
animals (Zhu, 2000), attenuation was similar (Figure S8, B2, 
dark blue). 



In most cases, transplantation of the vector of conductances 
to variants within the same inhibitory m-type preserved target 
physiology (~80% of models accepted), which was often main- 
tained, even when conductances were transplanted to other 
inhibitory m-types (~60% of models accepted). This suggests 
that, in animals of the age used in the experiments, electrical 
behavior is relatively independent of the specific neuron 
morphologies. 

Reconstructing Synaptic Physiology 

To predict the physiology of the ~36 million synapses in the 
reconstruction, we integrated published paired-recording data 
and reported synaptic properties (conductances, postsynaptic 
potentials [EPSPs/IPSPs], latencies, rise and decay times, fail- 
ures, release probabilities, etc.; see Experimental Procedures 
and NMC portal). 

Neocortical synapses display known forms of short-term dy- 
namics, which we used to classify synaptic connections as facil- 
itating (El and II), depressing (E2 and 12), or pseudo-linear (E3 
and 13) s-types (Figures 9A and 9B) (Beierlein et al., 2003; Reyes 
and Sakmann, 1999; Reyes et al., 1998; Thomson and Lamy, 
2007; Thomson et al., 1996; Wang et al., 2006). The s-types of 
specific connections were determined from the combination of 
their pre- and postsynaptic me-types (AN et al., 2007; Bannister 
and Thomson, 2007; Beierlein and Connors, 2002; Feldmeyer 
et al., 2002; Frick et al., 2007; Gupta et al., 2000; Markram 
et al., 1998; Reyes et al., 1998; Somogyi et al., 1998; Thomson 
et al., 1993). Based on the available experimental data, we iden- 
tified five rules to predict s-types for broad classes of connec- 
tions: (1) pyramidal-to-pyramidal connections are always 
depressing (E2) (Feldmeyer et al., 1999; Frick et al., 2007, 
2008; Gupta et al., 2000; Maffei et al., 2004; Markram et al., 
1998; Mason et al., 1991; Mercer et al., 2005; Reyes et al., 
1998; Thomson and Bannister, 1998; Thomson et al., 1993), (2) 
pyramidal-to-interneuron connections are also depressing (E2) 
(Angulo et al., 1999; Blatow et al., 2003; Holmgren et al., 2003; 
Markram et al., 1998; Reyes et al., 1998; Silberberg and Mark- 
ram, 2007; Thomson and Deuchars, 1997; Wang et al., 2002), 
except for connections onto Martinotti, bitufted and other inter- 
neuron types displaying spike frequency accommodation, which 
are facilitating (El) (Kapfer et al., 2007; Markram et al., 1998; 
Reyes et al., 1998; Rozov et al., 2001 ; Silberberg and Markram, 
2007), (3) facilitation from inhibitory neurons is around two times 
stronger than from excitatory neurons (Gupta et al., 2000; Silber- 
berg and Markram, 2007), (4) synaptic dynamics are preserved 
across layers for all me-type-specific connections, and (5) any 
remaining connections belong to the most common s-type 
(type 2; E2 or 12). 

Since physiological characterization of all 27,625 unique me- 
type-to-me-type connections is not feasible, s-types in which 
experimental data were missing were specified using the rules 
above. Parameters for the synaptic dynamics of individual 
synapses were drawn from experimental distributions. In this 
manner, we generated a complete, albeit sparsely character- 
ized, map of synaptic dynamics (Figure 9C). Stochasticity of 
synaptic transmission was modeled by extending a previously 
reported model (Fuhrmann et al., 2002). As an independent vali- 
dation of the modeled synaptic dynamics, we compared the 
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Figure 8. Workflow for High-Throughput Reconstruction of Morpho-Electrical Behaviors 

(A) Morphology and ion channel models. Selection of exemplar morphology, ion channel models, and their distribution on soma, dendrites, and axon. 

(B) Feature extraction. Selection of experimental traces from a population of recorded cells as targets for fitting. Extraction of voltage and spiking features from 
experiments. 

(C) Quality assurance. Multi-objective optimization of the vector of ion channel conductance densities to match the statistics of the extracted biological features in 
the model. Screen out models with electrical features that do not match the statistics for equivalent features in biological recordings. 

(D) Models of e-types. Shows the 1 1 e-types modeled. 

(E) Generalization and model quality. Generalization of the vector of ion channel conductance densities to other exemplars of the same m-type; application of a 
standardized set of measurement protocols to each model neuron to determine generalization; quality scores for accepted models (median z-score). 

See also Figure S7 for properties of modeled ion channels and Figure S8 for dendritic properties. 



coefficient of variation (c.v.) of first PSPs against reported exper- 
imental data (r = 0.8; Figure 9D; Gupta et al., 2000; Markram 
et al., 1998; Wang et al., 2006). 

We then applied unitary synaptic conductances obtained in 
previous experiments that also measured somatic postsynaptic 
potentials (PSP) between specific pairs of m-types and 
compared the resulting in silico PSPs with the corresponding 



in vitro PSPs (Figure 1 0A). The in silico PSPs were systematically 
lower. Since the neuron models and the numbers and locations 
of synapses between pairs of m-types had been validated, we 
hypothesized that the reported synaptic conductances had 
been underestimated, because of inadequate compensation 
for space-clamp errors (Feldmeyer et al., 2002; Gupta et al., 
2000; Rinaldi et al., 2008). To quantify the underestimate. 
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Figure 9. Reconstructing Dynamic Synaptic Transmission. 

(A) In silico synaptic connection. Experimental protocol recreated in silico to obtain the frequency dependence of synaptic transmission between pairs of neurons. 
A presynaptic L4NBC (black pipette) was stimulated with a 30 Hz pulse train to evoke eight APs + 1 “recovery” AP (bottom trace), resulting in inhibitory depressing 
responses (top traces) in the postsynaptic L4SS (blue pipette); 30 individual trials in gray, average in blue. The connection was mediated by 12 synaptic contacts 
(blue stars). 

(B) Synapse types (s-types). Parameters describing six s-types in the Tsodyks-Markram phenomenological synapse model (see Experimental Procedures). 

(C) Map of predicted synaptic dynamics. Previously established mapping rules were used to constrain s-types for connections that have not yet been char- 
acterized experimentally (see Figure S8), yielding a complete map for all 1 ,941 m-type-to-m-type connections in the reconstructed microcircuit. 

(D) Validation. Trial-to-trial variability for different s-types in silico compared to in vitro data. Dots and error bars show mean ± SD of the data; dashed line shows 
regression fit. 



synaptic conductances were adjusted until in silico PSPs 
matched experimental levels (Figure 10B and Table S2; Angulo 
et al., 1 999; Le Be et al., 2007; Feldmeyer et al., 2006; Feldmeyer 
et al., 1 999, 2002; Markram et al., 1 997; Silberberg and Markram, 



2007). The results suggested that reported conductances are 
about 3-fold too low for excitatory connections, and 2-fold too 
low for inhibitory connections (Table S2; Gupta et al., 2000; Ri- 
naldi et al., 2008). Other recent studies also suggest that 
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Figure 10. Reconstructing Quantal Synaptic Conductances. 

(A) Unitary synaptic responses. A single AP was evoked in a presynaptic L1 MAC (black pipette and trace). The postsynaptic potential was recorded at the soma of 
a L23PC (blue pipette; 30 individual trials in gray, average in blue); synaptic conductance was recorded simultaneously in the dendrite (green pipette and trace). 

(B) Validation. Comparison of in silico PSP amplitudes to in vitro characterized connections (n = 9; mean ± SD; Table S2), explicitly correcting reported con- 
ductances for space-clamp errors (see Experimental Procedures). Dots and error bars show mean ± SD of the data; dashed line shows regression fit. 

(C) Validation. As B for connections that lack conductance estimates (n = 10; mean ± SD; Table S4). Conductances were generalized from B for broad classes of 
excitatory and inhibitory connections (see Experimental Procedures). 

(D) Quantal synaptic conductances. In the absence of experimental data for postsynaptic potentials, synaptic conductances were generalized from data for 
similar connections, allowing the prediction of quantal synaptic conductances for all synapses on a neuron. Simultaneous recording of quantal synaptic con- 
ductances in a L5TTPC are shown in colored traces (excitatory, red to pink; inhibitory, green to blue). 

(E) Predicted map of quantal conductances. Circles indicate connections used in B (black) and C (white) above. Black lines separate excitatory m-types. 

See also Figure S9 for examples of in silico synaptic patch and staining experiments and Table S2 for corrected conductances. See also Movie S2. 
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previously reported values are underestimated to a similar de- 
gree (Sarid et al., 2007; Williams and Mitchell, 2008). 

For the vast majority of connection types, no experimental 
data for synaptic conductances were available. Therefore, we 
computed the average corrected synaptic conductances for 
broader classes of synaptic connections (e.g., E-E, E-l, l-l, l-E; 
see Experimental Procedures) and applied these conductances 
to all specific connections where data were missing. The result- 
ing amplitudes of in silico PSPs were validated against experi- 
mental data for ten connection types not used in determining 
the conductances (regression, r = 0.6; Figure 10C and Table 
S2). The derived synaptic dynamics and quantal conductances 
compared well with previous reports (Feldmeyer et al., 2002; 
Ramaswamy et al., 2012; Silberberg and Markram, 2007; Thom- 
son and Deuchars, 1997) (see NMC Portal). Using the same 
method, we generated a first prediction of mean synaptic con- 
ductances for all 1,941 m-type-to-m-type connections (Fig- 
ure 10D). Unique quantal synaptic conductances for individual 
synapses were drawn from truncated normal distributions 
around these means (Figure 1 0E; see Experimental Procedures). 

We performed in silico paired recordings of all 1,941 m-type- 
to-m-type connections in the average microcircuit (BioM) and 
found results comparable to previously published paired record- 
ings in vitro (see Figures S9A-S9J for an example; see NMC 
Portal). Obtaining the anatomical and physiological properties 
of all the intrinsic synapses formed onto and by any neuron 
has long been an experimental challenge (Crick, 1979). The 
reconstruction now allows in silico retrograde staining experi- 
ments for any neuron in the microcircuit, providing a detailed 
view of its presynaptic neurons and their synapses (Figure S9K). 
In silico anterograde staining for postsynaptic neurons is also 
possible. Figure S9L illustrates predicted locations of afferent 
synapses formed onto a L23_PC. Figure S9M shows the mean 
number of presynaptic (red) and postsynaptic (blue) neurons 
for excitatory (top) and inhibitory (bottom) m-types. Predicted 
input-output synapses for all 31,346 neurons in the BioM 
microcircuit and summary statistics for each of the 55 m-types, 
11 e-types, and 207 me-types can be downloaded from the 
NMC Portal. The portal also provides NEURON models of each 
m-type, allowing simulation experiments exploring dendritic 
integration of m-type-specific synaptic inputs. 

We found that the m-, e-, and s-types of inputs to any partic- 
ular neuron were always strikingly different from those of its 
outputs (i.e., inputs and outputs were highly asymmetrical; see 
NMC Portal). The predicted average total synaptic conductance 
for single neurons was ~1000 nS (~750 nS excitatory and 
~250 nS inhibitory conductance; based on all synapses in 
BioM). Predicted average quantal conductance was 0.85 ± 
0.44 nS for excitatory synapses (corresponding to ~150 AMPA 
and ~20 NMDA receptors; Yoshimura et al., 1999) and 0.84 ± 
0.29 nS for inhibitory synapses (corresponding to ~40 GABAa 
receptors; Ling and Benardo, 1999). The average failure rate 
across all 1,941 m-type-to-m-type connections was 11.1% ± 
14.1%. 

Simulating Spontaneous Activity 

To simulate reconstructed microcircuits at the level of detail 
described above, the NEURON simulator was extended to run 
on supercomputers (Figure S10; Carnevale and Flines, 2006; 



Hines and Carnevale, 1997; Hines et al., 2008a, 201 1 , 201 1 ; Mi- 
gliore et al., 2006), and additional functionality was developed to 
support in silico experimentation (see Experimental Procedures). 
We then used simulations to investigate the neuronal activity of 
the reconstructed microcircuitry (Figure 11 A) under different 
conditions. We began by simulating spontaneous activity during 
tonic depolarization (see Movie S3A), attempting to mimic previ- 
ous in vitro experiments (see Experimental Procedures). Under 
these conditions, neurons belonging to all m-types were active 
and the network exhibited spontaneous slow oscillatory popula- 
tion bursts, initiated in L5, spreading down to L6, and then up to 
L4 and L2/3 with secondary bursts spreading back to L6 (Fig- 
ure 11B). Despite apparent global synchrony, the 55 m-types 
generated diverse patterns of spiking (Figures 1 1C and 1 1 D). 

To allow comparison with the in vitro experiments, from which 
the physiological data were obtained, we reconstructed a virtual 
brain slice (a mesocircuit) that was 230 iim thick and whose 
width was equivalent to that of seven microcircuits (containing 
a total of 139,834 neurons) (Figure 12A; see Movie S3B and 
Experimental Procedures). The virtual slice reproduced the 
oscillatory bursts (~1 Hz) found in the previous microcircuit 
simulations (Figure 12B), which are comparable to those found 
in in vitro experiments (Lorincz et al., 2015; Sanchez-Vives and 
McCormick, 2000). 

In vitro experiments are typically performed at 2 mM [Ca^^o> 
while the level of [Ca^^o in vivo is reported to lie in the range 
0.9-1 .1 mM (Amzica et al., 2002; Jones and Keep, 1988; Massi- 
mini and Amzica, 2001; also see Borst, 2010), increasing in 
oscillatory cycles to 1 .2-1 .3 mM during the transition from wake- 
fulness to sleep (Amzica et al., 2002; Heinemann et al., 1977). 
Although it is not possible to fully mimic in-vivo-like conditions, 
we nonetheless explored the behavior of the circuit at these 
lower Ca^'^ levels as an approximation of the in vivo condition. 

It is well known that the [Ca^^o in the extracellular space mod- 
ulates the probability of neurotransmitter release (Borst, 2010; 
Chana and Sakmann, 1998; Rozov et al., 2001). We therefore 
modified the probability of release, consistent with experimental 
data for the specific sensitivities of different s-types to changes 
in [Ca^^o (Gupta et al., 2000; Rozov et al., 2001; Silver et al., 
2003; Tsodyks and Markram, 1997) (Figure S11; see Experi- 
mental Procedures). We found that, in the low Ca^"^ condition, 
slow oscillatory bursting disappeared and the neuronal activity 
became asynchronous and irregular (Figure 12C). To validate 
this in silico finding, we performed multi-electrode array record- 
ings in vitro (Figure 12D) in high and low Ca^"^ conditions (see 
Experimental Procedures). As predicted by the simulations, 
we found that the slow oscillatory bursts present in high Ca^'^ 
(Figure 12E) were replaced by asynchronous and irregular activ- 
ity under low Ca^^ conditions (Figure 12F). 

We then used the virtual slice to explore the behavior of the 
microcircuitry for a wide range of tonic depolarization and Ca^"^ 
levels. We found a spectrum of network states ranging from 
one extreme, where neuronal activity was largely synchronous, 
to another, where it was largely asynchronous (the synchro- 
nous-asynchronous [SA] spectrum; Figure SI 2). The spectrum 
was observed in virtual slices, constructed from all 35 individual 
instantiations of the microcircuit (seven for each of Bio1-Bio5) 
and all seven instantiations of the average microcircuit (BioM). 
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Figure 11. Simulation of the Reconstructed Microcircuit 

(A) Simulation of spontaneous activity. Individual neurons at different levels of depolarization in the microcircuit are colored according to a heatmap (blue, 
hyperpolarized; red, depolarized; white, spike). 

(B) Rastergrams of randomly selected neurons for each m-type during synchronous bursting. 

(C) Exemplar voltage traces for each of the 55 m-types during spontaneous activity in the microcircuit (traces truncated at -30 mV). 

(D) Inter-spike interval (ISI) distributions of each of the 55 m-types for the activity shown in C. 

See also Figure S10 for an overview of the software ecosystem surrounding the simulation of the microcircuit. See also Movie S3A. 



This implies that it is a highly reproducible phenomenon, robust 
to biological and statistical variations in parameters such as 
layer thickness, cell density, and composition; specific synaptic 
connectivity; and the specific dimensions of the microcircuit 
(see Movie S3C). 

We observed that a change in [Ca^^o of < 1 mM can lead to a 
transition from the synchronous to the asynchronous state, 
revealing two distinct activity regimes (Figure SI 2). The level of 
[Ca^^o at the transition varied slightly across the different instan- 
tiations of the microcircuit (Bio1-Bio5; Figure SI 3). 

Since the reconstructed microcircuitry displays synaptically 
coupled assemblies comparable to those found experimentally 
(Pehn et al., 201 1 ; Reimann et al., 201 5), we also analyzed corre- 
lations in neuronal activity within these assemblies. Neuronal 
activity was found to be slightly more correlated within assem- 



blies compared to randomly sampled neurons (Figure SI 4). 
Near the transition, a fall in [Ca^^o of just ~0.15mM (Figure SI 4) 
led to a sharp decrease in correlated spiking, clearly demar- 
cating a transition in the SA spectrum. 

The mechanism underlying this sharp transition is likely to 
involve the differential Ca^"^ sensitivities of inhibitory and excit- 
atory synapse types. Indeed, we found that changing [Ca^^o 
from 2 mM to 1.3 mM alters the ratio between excitatory and 
inhibitory synaptic PSPs by a factor of ~3.5, in favor of inhibition 
(Figure S1 1). This suggests the existence of a threshold level of 
Ca^"^ beyond which inhibition is insufficient to prevent a super- 
critical state (see below). 

The finding that differential sensitivity of s-types to Ca^^ 
levels determines the position of the network along the SA 
spectrum suggests that other mechanisms that change the 
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Figure 12. Predicting and Validating Synchronous and Asynchronous States in Spontaneous Activity 

(A) A spontaneously active virtual slice formed from seven unitary microcircuits (230.9 x 2800 x 2082 |im). 

(B) Rastergram of a random selection of neurons during in silico spontaneous activity under in-vitro-like conditions (somatic depolarization to ^90% threshold, 
([Ca^^o = 2.0 mM). Number of neurons displayed per layer is proportional to the total number of neurons per layer. 

(C) Rastergram of a random selection of neurons during in silico spontaneous activity under in-vivo-like conditions (somatic depolarization to -^90% threshold, 
([Ca^1o=1-0 mM). 

(D) To assess network activity, 300-|^m-thick cortical slices were mounted on a 3D multi-electrode array (MEA) (reconstruction of a layer 5 pyramidal cell overlaid). 

(E) Experimentally observed spontaneous multi-unit activity under in vitro [Ca^^o- 

(F) Experimentally observed spontaneous multi-unit activity under in vivo-like [Ca^^o- 

See also Figure S1 1 for [Ca^^o sensitivity of synapse types; Figures SI 2 and SI 3 for activity along the full spectrum of [Ca^^o concentrations and its biological 
variability; Figure SI 4 for synchrony in synaptically clustered neurons along the spectrum; Figure SI 5 for the effect of selective knockouts on microcircuit activity. 
See Movie S3B for a visualization of B and Movie S3C for C. 



excitatory-inhibitory balance may have similar effects. We there- 
fore performed in silico knockout experiments to understand the 
roles of the different layers, neurons, and connections in control- 



ling the position of the microcircuit on the spectrum (Figure S1 5). 
We found that blocking activity in the upper layers tended to shift 
the network toward the synchronous state, while blocking the 
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deeper layers had the opposite effect (Figure S15A). Similarly, 
blocking soma-targeting basket cells produced a stronger shift 
toward the synchronous state than blocking other interneurons, 
while blocking pyramidal cells caused a shift toward the asyn- 
chronous state (Figure S15B). Corresponding differential effects 
were found when blocking associated inhibitory and excitatory 
connections (Figure S15C). These effects were observed both 
at high and low [Ca^^o- It follows that differential regulation of 
layers, neurons, and connections plays an important role in con- 
trolling the position of the microcircuit along the SA spectrum, 
independently of [Ca^^o- 

Simulating Thaiamic Activation of the Microcircuit 

To examine spatio-temporal patterns of evoked activity, we 
constructed a mesocircuit consisting of a central microcircuit 
surrounded by six additional microcircuits. Connectivity was 
established for the mesocircuit as a whole, with no anatomical 
borders between microcircuits. An algorithm was developed to 
approximate input from the thalamus to the central microcircuit 
in such a way as to satisfy experimental constraints. We used 
data for the number of incoming fibers, bouton density profiles, 
and the numbers of synapses per connection (to layer 4) for 
the ventral posteromedial (VPM) thalamic input to the barrel re- 
gion of somatosensory cortex (Constantinople and Bruno, 
201 3; Gil et al., 1 999; Meyer et al., 201 Ob). To represent the num- 
ber of fibers, we instantiated one fiber centered in each minicol- 
umn with a horizontal spread (Meyer et al., 201 Ob). We then used 
the layer-by-layer bouton density profiles (Meyer et al., 2010b) 
(Figure 13A, left), experimental measurements of the mean num- 
ber of synapses per thalamic connection in layer 4 (Amitai, 2001 ; 
Gil et al., 1999), and the multi-synapse principle (see above and 
Experimental Procedures) to predict the synapses that each 
thalamic fiber forms onto different m-types (Figure 13A, right). 
The reconstruction reproduced the number of synapses formed 
on L4PCs (Figure 13B) (Amitai, 2001; Gil et al., 1999) and pre- 
dicted, for example, an average of ~1 2 synapses on L5 pyrami- 
dal neurons (Figure 13B), more than for L4PCs. Overall, we pre- 
dicted that each thalamic fiber innervates 903 ± 66 neurons 
(mean ± SD; n = 100 fibers; Figure 13C; 775 ± 57 excitatory 
and 83 + 11 inhibitory neurons) with an average of 8.1 ± 4.2 syn- 
apses/connection. In total, we found that thalamic fibers form ~2 
million synapses in the central microcircuit (~1 % of synapses 
across all layers; see Meyer et al., 2010b). 

Thalamocortical synaptic transmission was modeled using 
in vitro data on synaptic dynamics (Figure 13D, left; Amitai, 
2001; Gil et al., 1999) and the generalized excitatory-to-excit- 
atory conductances derived above (i.e., similar to L4_EXC, E2 
s-type; see Experimental Procedures). Ca^"^ dependency was 
modeled as for other excitatory connections. The resulting syn- 
aptic transmission was validated by comparing in silico PSPs in 
L4 and L5 PCs in low-Ca^'^ conditions against previous in vivo re- 
ports (Bruno and Sakmann, 2006; Constantinople and Bruno, 
2013; Figure 13E). Distributions of PSPs in L4 and L5 PCs in 
high-Ca^'^ conditions were also predicted (Figure 13D, right). 

With [Ca^^o at 1 .25 mM and moderate depolarization (in-vivo- 
like conditions), the main response to stimulation of thalamic 
fibers was in L4 to L6 (Figure S16A). Examination of the 
spiking activity of a random selection of neurons, covering all 
55 m-types, showed that most m-types in these layers re- 



sponded to the stimulus (Figure SI 6B). To investigate the effects 
of a graded stimulus, we used a single synchronous spike to 
activate a progressively increasing number of fibers innervating 
the center of the mesocircuit. With [Ca^^o at 2.0 mM and zero 
depolarization (in-vitro-like conditions), activating four or more 
fibers evoked a stereotypical high-amplitude PSTFI response 
(>80 Hz, Figure 14A), similar to previous in vitro observations 
(Beierlein et al., 2002). In contrast, stimulation under in-vivo- 
like conditions produced graded responses, with 20-30 Hz oscil- 
lations emerging in the lower layers, particularly in L6, when 
higher numbers of fibers were stimulated (Figure 14B). While 
under in-vitro-like conditions, stimulating as few as four thalamic 
fibers produced all-or-none behavior, indicative of a regenera- 
tive state that spread across the whole mesocircuit (Figure 1 4C), 
under in-vivo-like conditions, the activity remained localized 
(Figure 14D). 

With increasing Ca^'^ levels, the stimulus response curves 
measured during the first 10 ms of thalamic stimulation shifted 
from a linear to a sharp sigmoidal shape (Figure 14E). Analysis 
of the velocity of spread revealed a qualitative difference be- 
tween the synchronous and asynchronous regimes. In the syn- 
chronous regime, the spread of activity accelerated over time, 
while in the asynchronous regime, it was constant until the ampli- 
tude of the activity fell to zero (Figure 14F). This suggests that, 
in the synchronous regime, inhibition cannot act fast enough to 
curb the excitation and prevent uncontrolled spreading activity. 
Correlated activity is maximal in the regenerative regime and 
minimal in the non-regenerative regime (Figure SI 4), suggesting 
a dynamic range for correlations to emerge during information 
processing under in-vivo-like conditions. Taken together, the 
simulations predict that, at the average [Ca^^o reported in the 
awake state (Jones and Keep, 1988; Massimini and Amzica, 
2001; Westerink et al., 1988), the neocortex will exhibit graded 
and spatially restricted activation, a prerequisite for the emer- 
gence of functional maps with high spatial resolution. 

At a level of tonic depolarization where the network is spontane- 
ously active in both the regenerative and non-regenerative 
regimes, we observed a spectrum of oscillations with lower fre- 
quencies (~1 Hz) in the regenerative regime and higher fre- 
quencies in the non-regenerative regime (~10-20 Hz; data not 
shown). Maximum power was observed in layers 5-6 in the regen- 
erative regime and in layers 2-3 in the non-regenerative regime 
(data not shown). This suggests that shifts along the SA spectrum 
contribute to the spectra of oscillatory frequencies observed 
in spontaneously active neocortex (see also Tan et al., 2014). 

To establish a more complete demarcation between different 
activity regimes, we performed a series of simulations system- 
atically exploring network state at varying levels of [Ca^^o and 
depolarization. An analysis of average firing rates demarcated 
the boundary between evoked and spontaneous activity (Fig- 
ure 15A). The boundary between spontaneous regenerative 
and non-regenerative regimes was demarcated by the pres- 
ence or absence of spontaneous bursting activity (Figure 15B). 
The transition between the evoked regenerative and non- 
regenerative regimes was determined by an analysis of the 
amplitude of the response to stimuli at the edge of the meso- 
circuit (Figure 15C). The combination of these activity maps 
demarcates four distinct activity regimes: evoked regenerative 
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Figure 13. Reconstructing Thalamocortical Input 

(A) Bouton and synapse profiles. (Left) The distribution of boutons across the depth of the microcircuit (orange line, Meyer et al., 2010b) assuming one afferent 
fiber from ventral posteromedial (VPM) thalamic nucleus per minicolumn, overlaid on randomly chosen neurons. (Right) The resulting synapses formed (green) 
with synapses formed by a single fiber (red). 

(B) Distribution of the number of synapses per connection formed by the population of thalamic fibers onto L4 and L5 excitatory neurons. L4 distribution is 
compared against in vitro data (dashed red line; Gil et al., 1999). Horizontal bar: mean ± SD. 

(C) Postsynaptic neurons. Distribution of the number of postsynaptic neurons innervated by individual thalamic fibers. 

(D) In-vitro-like conditions. (Left) Synaptic dynamics of thalamocortical connections to L4 excitatory cells (gray, 30 trials; blue, average; [Ca^^o = 2.0 mM). (Right) 
Distribution of PSP amplitudes of thalamocortical connections to L4 and L5 excitatory cells ([Ca^'^]o = 2.0 mM, horizontal bar, mean ± SD). 

(E) In vivo-like conditions. (Left) Synaptic dynamics of thalamocortical connections to L4 excitatory cells (gray, 30 trials; blue, average; [Ca^^o =1-3 mM). (Right) 
Distribution of PSP amplitudes of thalamocortical connections to L4 and L5 excitatory cells ([Ca^^o = 1 -3 mM, horizontal bar, mean ± SD). L4 and L5 distributions 
are compared against in vivo data (L4, dashed red line, left; Bruno and Sakmann, 2006; L5, star dashed red line; Constantinople and Bruno, 2011). 

See also Figure S1 1 for [Ca^^o sensitivity of synapse types. 



(ER), spontaneous regenerative (SR), evoked non-regenerative 
(EN), and spontaneous non-regenerative (SN) (Figure 15D). 

Reproducibility of Emergent Properties 

The reconstructed microcircuitry is based on biological data 
from a large number of different animals and, in some cases. 



from different neocortical regions that together provide statis- 
tical distributions for layer heights, neuron densities, cellular 
composition, and morphological and electrophysiological diver- 
sity within and across types of neuron and reflect the diversity 
of synaptic anatomy and physiology observed in biological 
experiments. The reconstruction process stochastically creates 
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instantiations of the digital microcircuit that respect these distri- 
butions. We have previously shown that detailed synaptic phys- 
iology is largely invariant across different instantiations of the 
digital microcircuit (Ramaswamy et al., 2012) and that emergent 
parameters such as the distributions of the locations of synapses 
formed by different presynaptic neurons are also largely invariant 
(Hill et al., 2012). 

To further assess the reproducibility of the reconstruction as a 
whole, we measured the variance of a range of its emergent 
anatomical and physiological properties (i.e., properties not 
directly specified by the data). The anatomical properties 
measured from seven instantiations of each microcircuit (seven 
instantiations of BioM and seven each for Bio1-Bio5) included 
total number of appositions and synapses, convergence and 
divergence of connectivity for each m-type, numbers of excit- 
atory and inhibitory synapses and connections, mean numbers 
and types of presynaptic neurons innervating neurons belonging 



to different m-types, and numbers of 
intra-and inter-laminar synapses and 
connections. In each case, we found low 
variance compared to the mean (see Fig- 
ure SI 7 for a sample; see Table SI for selected values; see also 
NMC Portal). 

To gain a deeper understanding of the physiological variability 
of the digital reconstruction, we examined trial-to-trial variability 
in the spiking activity of individual neurons and variability across 
neurons of the same type, as well as variability across layers and 
across digital reconstructions individualized with data from five 
different animals (Biol -Bio5). Cell responses to a single thalamic 
stimulation, roughly comparable to a single whisker deflection 
(stimulation of a cluster of 60 minicolumns), displayed varying 
degrees of trial-to-trial variability (Figure 16A). Since the digital 
reconstruction implements biologically grounded stochastic 
mechanisms for synaptic transmission, spontaneous release, 
and some ion channels, this was expected. However, each 
cell-type also displayed a characteristic delay to first spike 
response. In some cases, the distribution of single-neuron re- 
sponses was similar to that of the population (Figure 16B, left). 



Figure 1 4. Activity Evoked by Thalamic Input 

(A) In-vitro-like stimulus-dependent evoked activ- 
ity. Raster plots (top) and PSTHs (bottom) of the 
response to stimulation of varying number of 
thalamic fibers under in vitro-like conditions (so- 
matic depolarization to -^60% threshold, [Ca^^o = 
2.0 mM). 

(B) In-vivo-like stimulus-dependent evoked activ- 
ity. Same as in A for in-vivo-like conditions (somatic 
depolarization to -^100% threshold, [Ca^^o = 
1.25 mM). Stimulation times and number of fibers 
stimulated are shown below. 

(C) In-vitro-like activity propagation. A mesocircuit 
under in vitro-like conditions (no somatic depolar- 
ization, [Ca^^o = 2.0 mM) stimulated with single 
synchronous spikes to each of 16 thalamic 
fibers at the center of the central microcircuit. The 
mesocircuit at L4 is depicted from above at 
different times after stimulation. Neuronal somata 
are rendered with a heat color map indicating level 
of depolarization. 

(D) In-vivo-like activity propagation. Same as in 
D but under in vivo-like conditions (somatic depo- 
larization to -^100% threshold, [Ca^^o = 1 -25 mM). 

(E) Stimulus response curves for various levels 
of Ca^^ and somatic depolarization to -^85% 
threshold. Response amplitude determined as the 
peak response for the central ten minicolumns in 
the first 10 ms of the response. 

(F) Propagation of the wave front with time in 
response to thalamic stimulation (at t = 0), 
measured as the half-maximum of a one-sided 
Gaussian fit to the wave front. Exponential fit for 
the regenerative activity and linear fit for non- 
regenerative activity (conditions as in C and D, 
respectively). In the non-regenerative regime, the 
amplitude of the wave front was zero at 25 ms. 
See also Figure SI 6 for stimulus responses of 
individual m-types. 
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Figure 15. The Regime Map 

Characterization of spontaneous and evoked activity under different ieveis of 
depoiarization and [Ca^'^]o. 

(A) Average spontaneous firing rates. White iine indicates interpoiated transi- 
tion between evoked and spontaneous regimes. 

(B) Presence of spontaneous bursting activity. White iine indicates transition 
between spontaneous regenerative (SR) and non-regenerative (NR) activity. 

(C) Map of evoked regenerative (ER) and non-regenerative activity determined 
by the ampiitude of the response to stimuii at the edge of the mesocircuit in 
reiation to the initiai response at the center. White iine indicates the transition 
between the ER and NR regimes. Biue-green region is extrapoiated to be NR. 

(D) Schematic map of showing the four activity regimes. Evoked regenerative, 
ER; spontaneous regenerative, SR; evoked non-regenerative, EN; sponta- 
neous non-regenerative, SN. 



while in others it was markedly different (Figure 16B, right). For 
both excitatory and inhibitory neurons, variance in response 
times decreased with cortical depth (Figure 16C). In all layers, 
trial-to-trial variability was lower than the variability between indi- 
vidual neurons of the same type in single trials. 

Responses from neurons of the same m-type, in digital recon- 
structions based on data from individual animals (Bio1-Bio5), 
displayed higher variability across reconstructions than in 
different instantiations of the average microcircuit (BioM) (for 
an example, see Figure 16D, left). To isolate the source of this 
inter-individual variability, we began by re-examining the SA 
spectra for the reconstructions at different levels of Ca^'^. We 
found that they all displayed the spectrum but that the precise 
level of Ca^"^ at the transition between the synchronous and 
asynchronous state was slightly different for each reconstruc- 
tion, ranging from 1.23 to 1.31 mM [Ca^^o (Figure SI 3). We 
therefore repeated the simulations, setting the Ca^'^ level such 
that each reconstruction was shifted to the same point along 
the spectrum relative to the transition. Under these normalized 
conditions, the variance in the responses of specific m-types 
across reconstructions decreased strikingly (Figure 16D, right). 
Figure 16E summarizes the different sources of variability for 
all neurons in L4 and L5. 

Taken together, these results demonstrate the ability of the 
digital reconstruction to accommodate physiological variability 
while maintaining reproducibility and are evidence of its potential 
to generate useful biological insights. To further test this poten- 
tial, we attempted to replicate results from an array of recent 
in vivo studies. 




Reproducing In Vivo Findings 

The digital reconstructions described above aimed to recreate 
the anatomy and physiology of an isolated slice of neocortical 
tissue, but not specifically to replicate any particular in vivo 
experiment. Nonetheless, we tested the ability of the digital 
reconstruction to replicate such experiments. We selected a 
set of recent in vivo studies in which a reasonable replication 
of the stimulation and analysis protocols was technically 
feasible. We then selected an arbitrary instantiation of BioM 
and used this model for all tests. In each case, we maintained 
the model’s original parameters, without introducing modifica- 
tions to fit previously reported results— a “zero tweak” strategy. 
All simulations were performed near the transition from the syn- 
chronous to the asynchronous state. 
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Figure 16. Reliability of Microcircuit Re- 
sponses 

(A) Raster plot of the spiking activity of an exem- 
piary L23PC (top) and an exempiary L4PC (bottom) 
in response to simuiated thaiamocorticai stimuia- 
tion with 60 fibers. The first spike after the stimuius 
in each of 200 triais is indicated in red, other spikes 
in biack. The biue iine indicates the probabiiity that 
the neuron fires a spike in a 5 ms bin. 

(B) Histograms of the response deiay (deiay of the 
first spike after stimuius presentation) for L23PCs 
(ieft) and L4PCs (right). (Biack) For 200 triais of 25 
randomiy chosen neurons of the indicated type. 
(Red) For 200 triais of the neurons indicated in A. 

(C) Standard deviation of the response deiay of 
neurons in different iayers across triais. Red iine 
indicates the median of neurons, biue boxes the 
25^*^ and 75^*^ percentiies, and whiskers the fuii data 
spread. (Top) Excitatory neurons; (bottom) inhibi- 
tory neurons. 

(D) Standard deviation of the response deiay 
across triais of neurons when piaced in microcir- 
cuits constructed from different bioiogicai data- 
sets (Bio1-Bio5). (Top) Of five L4SPs. (Bottom) Of 
five LGSBCs. (Left) Linder simulated extracellular 
Ca^^ concentration of 1.25 mM. (Right) When the 
caicium concentration was set to a vaiue on the 
border between regenerative and non-regenera- 
tive activity for that particuiar microcircuit. Boxes 
and whiskers as in 0. 

(E) Comparison of the different sources of vari- 
abiiity. (Left to right) Inter-triai variabiiity (same 
neuron in same microcircuit across triais); neuronai 
variabiiity (same triai in same microcircuit across 
neurons of a given m-type); inter-circuit variabiiity 
(same neuron in different microcircuit); inter-circuit 
variabiiity under normaiized Ca^'^ concentrations. 
(Top) in iayer iV; (bottom) in iayer V. Boxes and 
whiskers as in 0. 

See aiso Figure SI 7 for the anatomicai variabiiity of 
microcircuits. 



Neuronal Responses to Single-Whisker Deflection 

Many in vivo studies of evoked neuronal activity have reported 
that basal activity is sparse in all cell types, that the response 
characteristics of individually recorded neurons display cell- 
type-specific diversity, and that response latencies are cell 
type and layer specific (Constantinople and Bruno, 2013; 
Reyes-Puerta et al., 2015). To test the ability of the reconstruc- 
tion to reproduce these findings, we attempted to replicate 
some of the experiments reported in a recent study by Reyes- 
Puerta et al. (2015), in which the authors recorded and analyzed 
neuronal responses to a single-whisker deflection in the 
barrel cortex of anaesthetized adult rats. We approximated the 
stimulus as a single pulse in 60 reconstructed thalamic fibers 
projecting to the center of the digital microcircuit. As shown in 
Figure 17A1 , the response to the stimulation displays cell-type- 
specific diversity that compares reasonably well with the results 
reported in Figure 3A of the Reyes-Puerta et al. study (Reyes- 
Puerta et al., 2015), with the exception of the OFF response, 
which is not as prominent. The general distribution of responses 
for excitatory and inhibitory cells is also comparable (Figure 1 7A2 
in silico versus Figure 3B in vivo), though again with fewer OFF 



responses. As in Reyes-Puerta et al. (Reyes-Puerta et al., 
201 5), most responses occurred within 1 0-20 ms of the stimulus 
and were generally led by inhibitory cells and, more specifically, 
by inhibitory cells in L4 and L5 (Figure 17A3 in silico versus Fig- 
ure 4B1 in vivo). 

Antl-correlated Inhibitory Activity Cancels Out Highly 
Correlated Excitatory Activity 

Many previous studies have struggled to explain the uncorre- 
lated neuronal spiking activity that is often observed in vivo 
(Oelikel et al., 2004; Mazurek and Shadlen, 2002), with some 
suggesting that it is the result of poorly correlated excitatory 
activity (as expected if excitatory neurons generate a rate 
code), while others argue that correlations in excitatory activity 
are cancelled out by anti-correlated inhibition (Beierlein et al., 
2000; Okun and LampI, 2008). A model developed to address 
this issue by Renart et al. (2010) shows that it is indeed theoret- 
ically feasible for anti-correlated inhibitory activity to cancel out 
highly correlated excitatory activity (see their Figure 3). To test 
this hypothesis, we therefore analyzed whether this phenome- 
non was evident during spontaneous activity in the digital recon- 
struction. Although the digital reconstruction was not specifically 
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Figure 17. Cell-Type Responses In Vivo and E-l Balance 

(A1) Cellular response types to simulated single-whisker deflection. Each subplot represents the activity of an individual cell, containing the raster plot aligned 
to simulated whisker deflection and the PSTH. Upon simulated whisker deflection, neurons increased their firing rate (ON cells), showed no change in firing rate 
(NR cells), or decreased their firing rate (OFF cells). (A2) Comparison of mean firing rates before and after whisker deflection plotted in logarithmic scale (2630 
excitatory and 550 inhibitory neurons). Empty symbols represent neurons showing no significantly different activity in both periods (NR cells), and filled symbols 
represent neurons showing significantly different (p < 0.05) activity (ON and OFF cells). (A3) Mean first-spike latencies of inhibitory (INH) and excitatory (EXC) 
neurons to simulated whisker deflection, defined by first spike occurrence within 30 ms after stimulation, mean over 200 trials, for all 31 ,346 neurons in the 
stimulated column. Each box plot represents median, interquartile, and range of latencies; crosses represent outliers (2.5 times interquartile range). 

(B1) Raster (top) of the spontaneous spiking activity of 500 excitatory (red) and inhibitory (blue) neurons under in-vivo-like conditions (100% depolarization and 
[Ca^^o = 1 -25 mM). Bottom curves show tracking of instantaneous population-averaged activities (transformed to z-scores, bin size 3 ms). Average firing rates of 
E and I cells were 1 .09 ± 1 .0 Hz and 6.00 ± 8.95 Hz, respectively (n = 1 ,000; mean ± SD). (B2) Histogram of spike-spike correlations (black, count window 50 ms) 
and of jittered spike trains (gray, jitter ± 500 ms). (B3) Population-averaged cross-correlograms of the somatic membrane current, when cells are held at the 
reversal potential of inhibition (blue) or of excitation (red) in both cells, or at one potential for one cell and at the other potential for the other cell (magenta). The 
black curve is for pairs at resting potential. 

See also Figure SI 8 for details of E-l balance. 



designed to produce this phenomenon, it nonetheless generated 
excitatory conductances in single neurons that were highly 
correlated but effectively cancelled out by anti-correlated inhib- 
itory conductances (Figure 17B). 

Deeper investigation revealed that spiking is correlated with 
momentary imbalances between excitatory and inhibitory con- 
ductances lasting <10 ms and that the timing of spikes can 
be predicted from the difference in the E and I conductances 
(Figure 17B). We also found that the precision with which 
these imbalances drive spiking falls dramatically as the network 
state shifts away from the transition in either direction. When it 



shifts toward the synchronous state, the correlation is strong 
but broad, resulting in a temporally imprecise increase in 
spiking more suitable for a rate code. When it shifts toward 
the asynchronous regime, the correlation is sharp but too 
weak to effectively drive spiking, a regime more suitable for a 
population code based on a high degree of correlated activity 
(Figure SI 8). 

Temporally Sequential Structure during Spontaneous 
Activity of L5 Neurons 

The search for precise temporal structures in brain activity, such 
as synfire chains, motifs, repeated spike patterns, etc., has a 



Cell 163 , 456-492, October 8, 2015 (s)2015 Elsevier Inc. 479 




Cell 



Spike triplets in the activated state 
(Luczak et al. 2007) 

A2 



Soloists & Choristers in the activated state 
(Okun era/. 2015) 

X40J 

X20j II 5s \ 




-48 48 

Latency 2 - Latency^ (ms) 
Latencyg - Latency^ (ms) 



100 200 300 

Time from activated state onset (ms) 



Figure 18. Triplet Structure and Diverse Population Coupling 

Precisely repeating triplet structures can be predicted from individual neural latencies under synchronous stimulation (20 thalamic fibers, [Ca^'^Jo = 1 .25 mM; cf. 
Luczak et al., 2007, Figure 5). 

(Al) Schematic depicting the structure of a spike triplet for a triad of neurons. (A2) Count matrix for a representative neuron triad. Black box indicates region 
containing precisely repeating triplets. White square signifies mode. (A3) Correlation between neural latency differences and triplet structures. (A4) Precisely 
repeating triplet probability peaks shortly after onset of activated state. This peak is significant when compared with two null hypotheses (independent Poisson 
model, blue curve; common excitability model, red curve). Dashed lines show standard deviation. 

(B1) Time course of population firing rate just below the transition to the synchronous regime (microcircuit “Bio5,” [Ca^^o = 1 -27 mM). (B2) (Top) Spike-triggered 
average of population activity (stPR) for four representative neurons in layers V and VI. (Bottom) Same as above but after shuffling (see Okun et al., 2015; cf. 
Figures 1 E and 1 G). (Inset) Distribution of the population-coupling coefficient before and after shuffling (see Okun et al., 201 5). (B3) Relative fractions of m-types of 
soloists (population coupling < 0.5) and choristers (> 2.0). 

See also Figure SI 9 for results under lower [Ca^^o. 



long history. These patterns are thought to reflect “stereotypical 
organized sequential spread of activation through local cortical 
networks,” as demonstrated recently (Luczak et al., 2007). Luc- 
zak et al. (2007) found a temporally sequential structure during 
spontaneous activity of L5 neurons in vivo in the somatosensory 
cortex (Luczak et al., 2007). In particular, they found that, after 
the onset of an UP state, trios of neurons generated spike motifs 
(triplets) with a precisely defined temporal relationship between 
spikes that could not be explained by random correlations during 
high-frequency spiking (see their Figure 5). A similar analysis of 
the evoked response to thalamic stimulation of L5 neurons in 
the digital reconstruction found the same repeating triplet struc- 
tures as observed in vivo (Figure 18A). A second in silico exper- 
iment further into the asynchronous regime (i.e., at lower Ca^^ 
levels; 1 .0 mM) showed no evidence of triplet structures (Figures 
S19A-S19C), supporting our prediction that, in the highly asyn- 
chronous regime, it is difficult for single neurons to track fine 
temporal structure in network activity unless the population of 



presynaptic neurons becomes highly synchronized, for example, 
by external input. 

Soloists versus Choristers 

A recent study showed that some neurons in a network display 
spiking activity that is tightly correlated with the average activity 
of the population of neurons in the network (choristers), while 
others display a diversity of spiking patterns whose correlation 
with that of the population is smaller than expected by chance 
(soloists), suggesting that they actively avoid correlating with 
the rest of the population (Okun et al., 2015). We simulated the 
spontaneous activity of a single microcircuit in the asynchronous 
state but close to the transition to synchronous state for 800 s 
(Figure 18B1; see also Figure S19D) and analyzed the spiking 
activity of every individual neuron in L5 and L6 with respect to 
the spiking of all others. Replication of the analysis in Okun 
et al. (2015) yielded comparable results, although the proportion 
of choristers appears to be somewhat higher in the digital recon- 
struction (Figure 1 8B2 in silico versus Figures 1 E and 1 G in vivo). 
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We found that soloists are predominantly interneurons, while 
choristers are mainly pyramidal neurons (Figure 18B3). Pyrami- 
dal cells can be found on both extremes; they tend to be soloists 
when their spontaneous firing rate is high, the ratio of excitatory 
to inhibitory synaptic innervation is high, and most of the inner- 
vating synapses are close to their somata (data not shown). 

Shifting the network further into the synchronous regime leads 
to an increase in the number of choristers, consistent with gen- 
eral recruitment of all neurons and a rate-based response. On 
the other hand, shifting the network further into the asynchro- 
nous regime results in a loss of both choristers and soloists dur- 
ing spontaneous activity (data not shown). This finding supports 
our prediction that, when the network is far into the asynchro- 
nous regime, single neurons cannot easily sense and respond 
to fine temporal structure in network activity. 

Functional Implications 

These replications of in vivo studies suggest that the digital 
reconstruction can yield physiologically relevant insights. We 
therefore went on to address two issues that it has not been 
possible to address experimentally, either in vitro or in vivo. 

In a first experiment, we investigated the ability of single L5 
pyramidal neurons to discriminate between spatially segregated 
inputs. As previously, we used stimuli that approximated a 
whisker deflection (Figure 19A). To measure how far apart the 
stimuli needed to be for single neurons of the microcircuit to 
discriminate between them, we progressively increased the 
spatial separation between the stimuli and measured the 
response of L5 pyramidal cells in terms of rate (represented by 
the number of spikes emitted) and timing (represented by the 
latency to first response). Analysis of the difference between re- 
sponses yielded a measure of latency- and rate-based discrim- 
ination. Figure 19B1 shows the responses of a single, arbitrarily 
selected neuron to stimuli applied at locations separated by 
150 |im. In this case, latency- and rate-based discrimination 
are both significant (Figures 19B2 and 19B3). Exploration of 
the discriminatory power of L5 pyramidal neurons with different 
separations between the stimuli and at different levels of Ca^"^ 
(Figures 19C1 and 19D1) showed that many neurons discrimi- 
nate between inputs separated by 150 iim or more and that a 
few can discriminate between stimuli with separations as small 
as 50 |im (i.e., approximately two minicolumns apart). In general, 
timing-based discrimination is much stronger than rate-based 
discrimination (Figures 19C2 and 19D2). Interestingly, at all sep- 
arations, discrimination is strongest at Ca^"^ levels close to the 
transition between the synchronous and asynchronous regimes 
(Figure 19D). 

Unexpectedly, we noticed a spatial asymmetry in the discrim- 
inatory power of the neurons (Figures 19C1 and 19C2, shaded 
background). To test the reproducibility of the phenomenon, 
we repeated the simulation using instantiations of Bio1-Bio5 
that we already knew to be highly variable (see Figure SI 7 and 
Table SI). All reconstructions showed asymmetry, but the spe- 
cific degree and pattern of asymmetry was different in each 
case (Figure 19E1, four instantiations shown). We therefore 
hypothesized that the asymmetry reflects local variations in con- 
nectivity arising from the statistical instantiation of the digital 
microcircuit, amplified by edge effects. To test this hypothesis. 



we repeated the discrimination experiment with a mesocircuit 
constructed as previously described ([Ca^^o 1-25 mM, separa- 
tion 150 |im, see shaded background in Figures 19C1 and 
19C2), taking the same microcircuit used in the previous ex- 
periment (Figures 19C1 and 19C2) as its central microcircuit 
(Figure 1 9E2, white hexagon). Under these conditions, the asym- 
metry was markedly reduced. We also found strong variation in 
overall discrimination power across the different instantiations 
(Figure 19E1). 

In the final series of simulations, we explored the relationship 
between the size of the network and its emergent properties, 
the emergence of the transition between the synchronous 
and asynchronous states, and the emergence of spontaneous 
spatio-temporal patterns for different sized networks (10- 
1,000 minicolumns). In reconstructions smaller than the 
anatomically defined microcircuit, the transition occurred at 
high levels of Ca^'^ and fell sharply with increasing size of the 
reconstruction, reaching a plateau in reconstructions larger 
than ~300 minicolumns (i.e., the size of the anatomically defined 
microcircuit; Figures 20A and 20B). Even in reconstructions 
as large as 1 ,000 minicolumns, the spectrum of states did not 
exhibit any further qualitative change (Figure 20B). Figures 20C 
and 20D show the emergence of spontaneous clustered activity 
as the network increases in size. We found that, in smaller recon- 
structions, the time course of spontaneous firing rates in different 
clusters of ~10 minicolumns was very similar and became pro- 
gressively dissimilar as the reconstructions increased in size (Fig- 
ure 20D). The between-cluster correlation coefficient decreased 
exponentially with increasing distance between clusters, also 
plateauing at distances comparable to the diameter of the 
anatomically defined microcircuit (~202 iim; Figure 20E). As a 
related measure, we also examined the trend in correlated activ- 
ity within a central set of ~50 minicolumns as the surrounding 
network increased in size. We found that, in larger networks, 
the correlation fell exponentially, bottoming out in microcircuits 
of ~300 minicolumns and larger (r = ~0.4; Figure 20F). 

DISCUSSION 

This paper presents a first-draft digital reconstruction of neocor- 
tical microcircuitry that integrates experimental measurements 
of neuronal morphologies, layer heights, neuronal densities, 
ratios of excitatory to inhibitory neurons, morphological and 
electro-morphological composition, and electrophysiology, as 
well as synaptic anatomy and physiology (see “Reconstruction 
Data” and Table S3). It has been validated against a spectrum 
of separate anatomical and physiological measurements not 
used in the reconstruction (see “Validation Data” and Table 
S3). The reconstruction provides predictions of a wide range of 
anatomical and physiological properties of the neocortical 
microcircuitry (Box 1). Simulation of the reconstruction shows 
a spectrum of emergent network activity states with a sharp 
transition from synchronous to asynchronous states. At this 
particular point along the spectrum, digital reconstructions 
reproduce a number of findings from in vivo studies, allowing 
deeper investigation of their underlying cellular and synaptic 
mechanisms. They also enable experiments that have not so 
far been possible either in vitro or in vivo. Investigation of the 
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Figure 19. Spatial Resolution 

(A) The ability of L5PCs in the microcircuit to discriminate between inputs given by bundles of ten thalamic fibers was examined. The stimuli were centered on 
locations offset from the center of the circuit to the left or right. (Top) Spatial extent of the synapses activated by ten thalamic fibers in the microcircuit. 

(B1) Raster plot of spiking activity of a neuron in response to ten repetitions of spatially constrained stimuli at different locations (1 s of pulses at 20 Hz). Black 
arrows on top indicate individual pulses. (B2) Histogram of the delay to the first spike after the start of stimulus presentation of the neurons in B1 . The difference in 
delay is statistically significant (p < 0.001 , Wilcoxon rank sum test). (B3) Histogram of the number of spikes during stimulus presentation (gray window in B1). 
Differences in spike counts were statistically significant. 

(Cl) Mean discrimination strength. -loglO of the p value as in B2 of L5PCs at different locations is indicated as color coded. Red and blue stars indicate the 
centers of the two stimuli to discriminate. Black dots indicate locations of individual L5PCs with a discrimination strength >2 (p < 0.01). Each row indicates a 
different extracellular Ca^^ concentration. (C2) Same, for the discrimination power based on spike count as in B3. 

(D) Fraction of L5PCs with a discrimination power >2 (light red) and >3 (dark red) for different conditions shown in C. Asterisks indicate instances in which the 
number of neurons with separation strength >2 is larger than can be explained as false positives (*p < 0.05; ***p < 0.001). 

(El) Discrimination power for a stimulus separation of 150 |im at 1 .25 mM [Ca^^o for four microcircuits based on biological datasets Bio2-Bio5. (E2) Discrimi- 
nation based on response delay and spike count when the same microcircuit was embedded in six surrounding microcircuits. 



size of network required to reproduce key functional properties 
of the microcircuit shows that it is roughly equivalent to the vol- 
ume of neocortical tissue used as the basis for the reconstruc- 
tion. This is evidence that a network of this size is the minimum 
functional unit required for neocortical information processing. 

Validity of the Digital Reconstruction 

The reconstruction certainly includes errors due to mistakes and 
gaps in experimental datasets and incomplete understanding of 



biological principles. For instance, additional cell type markers 
would improve the accuracy of the morphological composition, 
saturated EM reconstructions could be used to further validate 
the derived connectivity, more experiments reporting combined 
voltage and current measurements for synaptic responses will 
strengthen conclusions on quantal conductances and connec- 
tion-specific synaptic dynamics, and further characterization 
of the sensitivity of different synapses to [Ca^^o allow 
more accurate demarcation of the transitions between different 
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Figure 20. Emergence 

(A) Transition between regenerative and non- 
regenerative regimes as a function of circuit size 
and calcium concentration. Panels of raster plots 
(top) and PSTHs (bottom) of spontaneous activity 
are shown for a selection of circuit sizes and cal- 
cium concentrations (100% depolarization). 

(B) Overview of a broad range of circuit sizes and 
calcium concentrations as in A. Red crosses and 
green dots indicate regenerative and non-regen- 
erative circuit behavior, respectively, as assessed 
by visual inspection. Black curve depicts interpo- 
lated transition between regenerative and non- 
regenerative regimes. 

(C) Spatial profile of instantaneous firing rates for 
circuits of increasing size. Mean instantaneous 
firing rates were estimated for contiguous group- 
ings (clusters) of approximately ten minicolumns 
using a K-means algorithm. Six spatial firing rate 
profiles are shown, generated by interpolating 
these rate estimates (see Experimental Pro- 
cedures) at two selected times for three circuit 
sizes. Colored circles show five exemplary cluster 
centers. 

(D) Time traces of firing rates for selected clusters. 
Firing rate time courses are shown for the clusters 
in C in corresponding color for all three circuit sizes. 
Dashed boxes indicate the times, ti and t 2 , at 
which spatial profiles are compared in C. 

(E) Pairwise cross-correlation coefficients of cluster 
firing rate time courses for all cluster combinations 
versus inter-cluster separation for varying circuit 
sizes (50 to 1 ,000 minicolumns). Pair-wise corre- 
lation decays exponentially with distance (blue 
dashed line shows exponential fit to 1 ,000 mini- 
column circuit dataset, space constant \ = 202 ^im). 

(F) Mean pairwise cluster correlation coefficients 
versus circuit size for each circuit’s centermost 
five clusters (red circles) and for all clusters (blue 
circles). Error bars indicate SD. Dashed curves 
indicate exponential fits to respective data. 



activity states and may allow a more precise determination of 
the role played by each neuron and synapse type in maintaining 
and shifting regimes. For example, a recent EM study found 
evidence for a higher number of synapses per connection than 
predicted by a naive interpretation of Peters’ rule (Kasthuri 
et al., 201 5). Applying the same analysis to the digital reconstruc- 
tion produced comparable findings (Figure S20 in silico versus 
Figures 7D, 7F, and SOB in Kasthuri et al., 2015). These proper- 
ties emerge in the digital reconstruction as a consequence of 
preferential pruning of connections with low numbers of synap- 
ses (Reimann et al., 2015). 

The validation tests conducted at multiple stages of the recon- 
struction process reduce the risk that errors could lead to major 
inaccuracies in the reconstruction or in simulations of its emer- 
gent behavior. For example, validation of electrical neuron 
models against independent data insulates the emergent 
behavior of the network from the impact of our limited knowledge 
of ion channel kinetics and distributions. More generally, 
the reconstruction passed multiple tests broadly validating 



its underlying anatomy and physiology. For instance, major 
errors in cell morphology, densities, composition, and con- 
nectivity would make it difficult to reproduce the types of 
neuronal assemblies discovered in 12-patch experiments, the 
numbers of GABAergic synapses on pyramidal somata, protein 
staining patterns, layer-wise synapse densities, connection 
probabilities, bouton densities, and distributions, etc. (see 
Table S3). These properties lie well within experimentally re- 
ported ranges. The reproducibility of observations and pre- 
dictions in multiple reconstructions using data from different 
animals and incorporating statistical variations provide evidence 
that they are robust. 

Although the reconstruction is, to our knowledge, the most 
detailed to date, it omits many important details of microcircuit 
structure and function, such as gap junctions, receptors, glia, 
vasculature, neuromodulation, plasticity, and homeostasis. 
Furthermore, it represents a snapshot of just one brain region, 
in one strain of male rat, at a young age. This limits the gener- 
ality of the conclusions that can be drawn. For instance, in 
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Box 1. Microcircuit Predictions 



1 . The cellular composition of the microcircuit 

2. Total lengths of dendrites and local axons 

3. Increase in neuronal diversity with cortical depth 

4. Total number of appositions and synapses 

5. Total number of connections and connection types 

6. Number of connections and synapses per connection between different neuron types 

7. Number of connections and synapses formed by incoming fibers 

8. Increase in the E-l neuronal fraction with cortical depth 

9. All input and output synapses for all neuron types 

10. Quantal synaptic conductances for all intrinsic synapses 

1 1 . Total excitatory and inhibitory conductances for all neuron types 

12. Number and combination of pre- and postsynaptic neurons for all neuron types 

13. Detailed synaptic physiology for connections between all neuron pairs 

14. E-l ratios within and across layers 

15. A spectrum of network states ranging from synchronous to asynchronous activity 

16. Extracellular calcium regulates the network state through differential effects on synaptic dynamics 

17. Role of layers, neuron, and connection types in modulating network states 

18. The in vivo phenomena examined only emerge near the transition between synchronous and asynchronous states 



animals of the age used for the study, dendritic morphologies 
have already matured to adult levels (Larkman, 1991a; Romand 
et al., 2011), but the ascending axons may not be fully repre- 
sented and are certainly not completely mature (Romand 
et al., 2011). However, studies at a greater level of biological 
detail (e.g., including glia, receptors, and signaling pathways) 
and investigations of different brain regions in animals of 
different ages, gender, and species, as well as in disease 
models can use the reconstruction as a reference point. Find- 
ings consistent with the reconstruction would indicate the 
sufficiency of the principles of organization used in the recon- 
struction process; discrepancies may point to new principles. 
For example, if application of the connectivity algorithm to 
another brain region or to animals at a different age or belong- 
ing to a different species failed to yield results consistent with 
experimental findings, this would point to specific variations 
in the connectivity rules. 

Failure in validation could also indicate errors in experi- 
mental data. For instance, the reconstruction indicated that 
cell densities from a dozen previous studies were all too low 
to account for spine and synapse densities, suggesting new 
experiments, which verified this prediction. The reconstruction 
also revealed that many experiments underestimate synaptic 
conductances and suggests that in vitro experiments that do 
not account for calcium level in the bath may misinterpret 
the relevance of their findings for in vivo conditions. These ex- 
amples illustrate how the reconstruction process does not take 
experimental data at face value but uses complementary, 
related datasets to constrain the use as parameters, wherever 
possible. 

Functional Implications 

Simulations of the spontaneous and evoked activity that ac- 
counted for the differential sensitivity to Ca^"^ of different types 
of synapses and that explored changes in Ca^'^ levels revealed 



a spectrum of activity states ranging from synchronous to asyn- 
chronous behavior. Varying the Ca^"^ level profoundly changes 
the overall E-l balance and hence the position of the network 
along the spectrum, leading to a sharp transition between activ- 
ity regimes. These in silico predictions were verified by new 
in vitro experiments. 

Further simulations showed that the level of Ca^"^, where the 
transition occurred, varies across digital reconstructions that 
use data from different animals and that this accounts for a 
significant proportion of the variance in neuronal spiking and 
the spatial resolution of the network. We also found that a 
small adjustment in Ca^'^ levels (~0.05 mM) in individual recon- 
structions significantly reduces their physiological variability. 
These simulations provide an example of how variations in in- 
dividual neuroanatomy may lead to functional differences. 

Inspired by this finding, we performed further simulations, 
which demonstrated that activating or inhibiting specific layers, 
neurons, and synaptic connections also shifts the network 
along the spectrum. While it is well known from previous 
theoretical findings that changing E-l balance changes the state 
of the network (Brunei, 2000; van Vreeswijk and Sompolinsky, 
1996), the simulations further suggest that any mechanism 
that differentially changes the synaptic dynamics of different 
types of synapses (e.g., through neuromodulation; for reviews, 
see Lee and Dan, 2012; Zagha and McCormick, 2014) could 
alter the boundaries between activity regimes in complex 
ways. We speculate that other emergent properties, such as 
UP and DOWN states with two meta-stable fixed points, 
as observed in vivo (Steriade et al., 1993), which are not 
reproduced by the digital reconstruction, may require thalamo- 
cortical interactions (Hughes et al., 2002), cortico-cortical inter- 
actions (Timofeev et al., 2000), intrinsic oscillators (Lorincz 
et al., 2015; Sanchez-Vives and McCormick, 2000), or neuro- 
modulation (Constantinople and Bruno, 2011; Lorincz et al., 
2015; Sigalas et al., 2015). Modulation of cellular or synaptic 



484 Cell 163 , 456-492, October 8, 2015 ©2015 Elsevier Inc. 





Cell 



physiology may therefore serve as mechanisms to dynamically 
reconfigure the network to satisfy different computational 
requirements. 

Reproducing In Vivo Findings 

Although the digital reconstruction was largely based on in vitro 
data and was not designed to reproduce any particular experiment 
orto capture complex in vivo conditions, it yielded results that were 
qualitatively comparable to a number of major in vivo findings and 
made predictions beyond what was possible in these experiments, 
without tweaking any of the model parameters. 

For example, the digital reconstruction made it possible to 
address a long-standing question concerning the mechanisms 
underlying the uncorrelated activity frequently observed in 
in vivo experiments (Haider et al., 2006). Previous theoretical 
work has shown that uncorrelated activity could be the result 
of tightly correlated excitatory conductances that are effec- 
tively cancelled out by anti-correlated inhibitory conductances 
(Renart et al., 2010; van Vreeswijk and Sompolinsky, 1996). 
Our simulations, using a model not specifically designed to 
address this question, confirm this effect as an emergent prop- 
erty of the network. The simulations further suggest that 
cortical activity in vivo approaches a critical transition along 
the synchronous asynchronous spectrum, beyond which 
regenerative activity leads to neuronal avalanches (see also 
Beggs and Plenz, 2003). Around this transition, spiking activity 
is highly correlated with fine temporal structure in synaptic 
input, reflected in brief moments of imbalance between excit- 
atory and inhibitory conductances. Maximal discrimination 
between spatially segregated inputs, the generation of fine 
temporal structures such as triplets, and soloist-like and 
chorister-like behavior all emerge close to the transition. A 
recent study has experimentally characterized the plasticity 
mechanisms for maintaining the network close to this transition 
(Delattre et al., 2015). 

Reproducing these in vivo findings was surprising because 
the digital reconstruction was based on data and architectural 
principles obtained from the immature rat somatosensory 
cortex, while many of the in vivo findings came from different 
neocortical regions in adult animals, sometimes belonging 
to other species. The fact that the reconstruction reproduces 
these phenomena suggests that they arise from fundamental 
properties of the neocortical microcircuit. 

Concluding Remarks 

This study demonstrates that it is possible, in principle, to recon- 
struct an integrated view of the structure and function of neocor- 
tical microcircuitry, using sparse, complementary datasets to 
predict biological parameters that have not been measured 
experimentally. Although the current digital reconstruction can 
already be used to gain insights into the way the microcircuitry 
operates, it is only a first step. To facilitate integration of new 
experimental data and challenges to the principles on which 
it is based, we have created a public web resource, which 
provides access to experimental data, models, and tools used 
in the reconstruction (The Neocortical Microcircuit Collabora- 
tion [NMC] Portal, https://bbp.epfl.ch/nmc-portal; Ramaswamy 
et al., 2015). This will allow the community to integrate their 



own data, perform their own analyses, and test their own 
hypotheses. 

EXPERIMENTAL PROCEDURES 

A detailed description is available in the Supplemental Experimental 
Procedures. 

Data Acquisition 
Neuron Morphology 

Neuron morphologies were obtained from digital 3D reconstructions of 
biocytin-stained neurons from juvenile rat hind-limb somatosensory cortex, 
following whole-cell patch-clamp recordings in 300-|am-thick brain slices 
(Markram etal., 1997). In some of the reconstructed neurons, bouton locations 
were annotated on the axon (Wang et al., 2002). Reconstruction used the Neu- 
rolucida system (MicroBrightField). 

Neuron Electrophysiology 

Neurons were stimulated with a set of previously described protocols (Le Be 
et al., 2007; Wang et al., 2002, 2004). A subset of these stimuli was used 
to generate neuron models; a different subset was used to validate the 
models. 

Synaptic Anatomy 

Data on the anatomy of synaptic connections were collected from previous 
studies in which synaptically coupled neurons were digitally reconstructed, 
and putative synapses were identified using criteria identifiable in light micro- 
scopy and validated using EM. In brief, putative synapses were identified at 
appositions between arbors, where a bouton was also present on the axon 
of the presynaptic neuron (Markram et al., 1997). 

Synaptic Physiology 

Presynaptic neurons were stimulated with a set of previously described proto- 
cols (Gupta et al., 2000; Markram et al., 1998; Tsodyks and Markram, 1997; 
Wang et al., 2002, 2006). The synaptic parameters required to model 
the synapses were obtained by fitting the responses against the Tsodyks- 
Markram model for dynamic synaptic transmission (Fuhrmann et al., 2002; 
Tsodyks and Markram, 1997). 

Tissue immunohistochemistry 

Standard immunohistochemical methods were used to label markers of cell 
types (Lefort et al., 2009). Stained cells were counted under light microscopy. 
Layer boundaries and densities per layer were computed on slices using opti- 
cal dissectors on NeuN-stained tissue (West and Gundersen, 1990; Williams 
and Rakic, 1988) and Stereo Investigator software (Stereoinvestigator 7.0, 
MicroBright Field). Data for each cortical layer (I, II, III, IV, Va, Vb, VI) were 
collected from different animals (n = 5). Final values for neuronal densities 
and layer thicknesses were corrected for shrinkage. E/I ratios were determined 
by soma counting in confocal microscopy imaging of dual NeuN- and GABA- 
stained tissue. 

Electron Microscopy 

Serial EM stacks were obtained for blocks of neocortical tissue, as previously 
described (Denk and Horstmann, 2004). 

Multi-electrode Array Experiments 

A 3D multi-electrode array with 60 pyramidal platinum electrodes (Qwane 
Bioscience SA) was used to obtain extracellular recordings from neurons in 
slices, as previously described (Delattre et al., 2015; Rinaldi et al., 2008). 
Experimental data analysis was performed in Matlab (The MathWorks) with 
custom scripts. Extra-cellular spikes were detected when the recorded signal 
crossed a dynamic threshold. 

Manipulating [Ca^*]o 

Extracellular Ca^"^ concentration ([Ca^^o) was changed by bath perfusion with 
artificial extracellular fluid containing a modified [Ca^^o- Bath changing times 
were minimized by employing a pipette to remove the recording chamber 
solution prior to changing the subsequent solution. 

Reconstruction Process 
Digital Neuron Morphologies 

Following 3D reconstruction, the cut ends of neuronal morphologies were 
restored using a repair algorithm (Anwar et al., 2009). Neuronal arbors 
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were digitally unraveled to compensate for tortuosity caused by shrinkage, 
and neuron morphologies were cloned (see Supplemental Experimental 
Procedures). 

Electrical Neuron Models 

Multicompartmental conductance-based models of neurons were generated 
using up to 13 active ion channel types and a model of intracellular Ca^^ dy- 
namics. Axon initial segments (AIS), somata, basal dendrites, and apical 
dendrites were separated. Interneurons contained only one dendritic region. 
Each region received a separate set of channels (see NMC portal, https:// 
bbp.epfl.ch/nmc-portal; Ramaswamy et al., 2015). Of the axon, only the AIS 
was simulated. Each AIS was represented by two fixed-length sections, 
each with a length of 30 ^im. AIS diameters were obtained from the recon- 
structed morphology used for model fitting. Action potentials detected in the 
AIS were sent to the postsynaptic synapses with a delay corresponding to 
the axonal length, assuming an axonal velocity of 0.3 m/s. Neuron models 
were fitted using a feature-based multi-objective optimization method, as 
previously described (Druckmann et al., 2007). 

The Microcircuit Volume 

Layer thicknesses and the diameter of the microcircuit were used to construct 
a virtual hexagonal prism (see main text). A virtual slice was generated from a 
1x7 mosaic of microcircuits as a sheet (230.9 x 2800 ^irn). A meso-circuit 
was also generated. The meso-circuit consisted of a single microcircuit 
surrounded by additional microcircuits on all faces. 

Cellular Composition 

Cell density measurements and experimentally determined fractions of m- and 
me-types were used to generate the position of each cell in the volume of 
tissue, using E:l ratios to correct for sampling bias. Each cell was assigned 
the optimal morphology for its location in the volume (see Supplemental Exper- 
imental Procedures). 

Synaptic Anatomy 

Locations of synapses were derived using an algorithm described in the com- 
panion article (Reimann et al., 201 5). The algorithm eliminates appositions that 
do not comply with the multi-synapse and plasticity reserve rules and ensures 
compatibility with observed biological bouton densities. 

Synaptic Physiology 

Excitatory synaptic transmission was modeled using both AMPA and NMDA 
receptor kinetics (Fuhrmann et al., 2002; Hausser and Roth, 1997; Markram 
et al., 1998; Ramaswamy et al., 2012; Tsodyks and Markram, 1997). Inhibitory 
synaptic transmission was modeled with a combination of GABAa and GABAb 
receptor kinetics (Gupta et al., 2000; Khazipov et al., 1995; De Koninck and 
Mody, 1997; Mott et al., 1999). Stochastic synaptic transmission was imple- 
mented as a two-state Markov model of dynamic synaptic release, a stochas- 
tic implementation of the Tsodyks-Markram dynamic synapse model (Fuhr- 
mann et al., 2002; Tsodyks and Markram, 1997). Biological parameter 
ranges for the four model parameters were taken from experimental values 
for synaptic connections between specific m- and me-types or between larger 
categories of pre- and postsynaptic neurons (see Figure 9). Spontaneous 
miniature PSCs were modeled by implementing an independent Poisson 
process for each individual synapse that triggered release at rates (7spont) 
determined by the experimental data (Ling and Benardo, 1999; Simkus and 
Strieker, 2002). 

Thalamic Innervation 

Thalamic input was reconstructed using experimental data for ventro-poste- 
rior medial (VPM) axon bouton density profiles in rat barrel cortex (Meyer 
et al., 2010b), synapses per connection, and approximate numbers of 
incoming fibers. Synapse locations were determined using a variant of the 
connectome algorithm (Reimann et al., 2015; see Supplemental Experimental 
Procedures). Synapses were assigned to incoming fibers based on a Gaussian 
probability centered around each fiber. 

Simulation 

Microcircuit Simulation 

The reconstructed microcircuit was simulated using the NEURON simulation 
package, augmented for execution on the supercomputer (Flines and Carne- 
vale, 1997; Hines et al., 2008a, 2008b), together with additional custom tools 
to handle the setup and configuration of the microcircuit and the output of 
results. 



In Silico Experiments 

Depolarization was achieved by simulating current injection at the neuron 
soma. Currents were expressed as percent of first spike threshold for each 
neuron. Changes in [Ca^^o were simulated by changing the use parameter 
of synaptic transmission according to three curves for specific m-types (see 
Figure SI 5). Neuronal in silico knockout experiments were performed by 
hyperpolarizing the target population with somatic current injection (-100% 
threshold). Thalamic fiber stimulations were performed on circular clusters of 
minicolumns. The methods used to replicate previous in vivo experiments 
are described in the Supplemental Experimental Procedures. 

Data Analysis 

Anatomical and physiological data analysis were performed using a custom 
suite of Python-based tools operating on a Linux cluster (Supplemental Exper- 
imental Procedures). The same analysis as described in Kasthuri et al. (2015) 
was applied to compare results between a saturated EM reconstruction and 
the digital reconstruction that we generated (see Figure S20). PSPs were 
measured at the somata or dendrites of randomly selected pairs of neurons 
(30 trials). PSTHs were computed from all neurons in the circuit and were 
normalized by neuron number and time bin to express the average instanta- 
neous firing rate. Mean spike-spike correlations were calculated as the histo- 
gram of intervals between all spike times of two different cells (bin size 1 ms). 
Evoked regenerative activity was defined as activity in which peak activity 
(PSTH) within 100 ms after stimulus of the outermost 20 minicolumns ex- 
ceeded 30 Hz and 70% of the activity of the 20 central minicolumns in the 
10 ms after stimulus. Spike rasters show spike events at the locations within 
the layers where they occurred (for clarity, only a fraction of spikes are plotted). 
Supercomputing 

Reconstruction and simulation workflows, such as neuron model optimization, 
circuit reconstruction, and network simulation, were executed on super- 
computers. The systems used included an IBM Blue Gene/L (until 2009), a 
CADMOS 4-rack IBM Blue Gene/P (until 2013), a CADMOS 1-rack IBM Blue 
Gene/Q (until 2014), and the Blue Brain IV operated by the Swiss National 
Supercomputing Center (CSCS) on behalf of the Blue Brain Project, ranked 
the 100^*^ most powerful supercomputing system (Top500, June 2015). Blue 
Brain IV includes a 4-rack IBM Blue Gene/Q, IBM Blue Gene Active Storage, 
and a 40-node Linux cluster for post-processing, analysis, and visualization, 
fully interconnected using Infiniband technology and a GPFS file system with 
4.2 Petabyte raw storage (Schurmann et al., 2014). 

Visualization 

Large circuits and simulations in high resolution were visualized using a 
custom-developed tool, RTNeuron (Hernando et al., 2012). High-quality, static 
images of small neural circuits, individual neurons, and synaptic spines 
and boutons were created using Maya 3D animation software (Autodesk, 
San Rafael, California, USA). 

Software Development 

Data integration and post processing as well as reconstruction, simulation, 
analysis, and visualization of neuronal network models used >30 software 
applications, integrated into automated and semi-automated workflows. 
Development was supported by a comprehensive development environment 
based on best practices for version control (git), code review (gerrit), and 
continuous building, testing, packaging, and deployment (Jenkins). 

SUPPLEMENTAL INFORMATION 

Supplemental Information includes Supplemental Experimental Procedures, 
20 figures, six tables, and three movies and can be found with this article online 
at http://dx.d 0 i. 0 rg/l 0. 1 01 6/j.cell.201 5.09.029. 
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SUMMARY 

As a basic functional unit in neural circuits, each 
neuron integrates input signals from hundreds to 
thousands of synapses. Knowledge of the synaptic 
input fields of individual neurons, including the iden- 
tity, strength, and location of each synapse, is essen- 
tial for understanding how neurons compute. Here, 
we developed a volumetric super-resolution recon- 
struction platform for large-volume imaging and 
automated segmentation of neurons and synapses 
with molecular identity information. We used this 
platform to map inhibitory synaptic input fields of 
On-Off direction-selective ganglion cells (On-Off 
DSGCs), which are important for computing visual 
motion direction in the mouse retina. The reconstruc- 
tions of On-Off DSGCs showed a GABAergic, 
receptor subtype-specific input field for generating 
direction selective responses without significant gly- 
cinergic inputs for mediating monosynaptic cross- 
over inhibition. These results demonstrate unique 
capabilities of this super-resolution platform for 
interrogating neural circuitry. 

INTRODUCTION 

Mapping synaptic connectivity at multiple scales, ranging from 
the synaptic fields of individual neurons to the wiring diagram 
of the whole brain, is important for understanding how neural cir- 
cuits function and how circuit defects contribute to mental illness 
(Alivisatos et al., 2013; Morgan and Lichtman, 2013). An ideal 
platform for imaging synaptic connectivity should provide (1) 
high-resolution structural information for reliable identification 
of synaptic connections and accurate assignment of synapses 
to neurons; (2) the ability to image specific molecules, such as 
neurotransmitter receptors, important for determining synapse 
identity and properties; and (3) automated image segmentation 
capability for efficient analysis of large-volume reconstructions 
that capture entire neurons or circuits. 

Both fluorescence microscopy and electron microscopy (EM) 
have been used for volumetric neural circuit reconstruction 



(Helmstaedter, 2013; Kleinfeld et al., 2011; Lichtman and Denk, 
2011). EM provides exquisite spatial resolution and membrane 
contrast for accurate synapse identification, and the high imaging 
speed of modern EM instruments allows increasingly larger vol- 
ume reconstructions (Helmstaedter, 2013; Kleinfeld et al., 2011; 
Lichtman and Denk, 2011). However, because of the stringent fix- 
ation and sample preparation conditions required for high-quality 
EM imaging, labeling of endogenous synaptic proteins for deter- 
mining the molecular identities and functional properties of syn- 
apses remains a difficult task for large-volume EM reconstruc- 
tions. In addition, automated segmentation of EM images is still 
challenging and remains a bottleneck for scaling up neural circuit 
analysis, though substantial progress has been made on the 
development of automated EM image analysis and crowd-sourc- 
ing methods (Chkiovskii et al., 2010; Helmstaedter, 2013; Jain 
et al., 201 0). In comparison, fluorescence microscopy is compat- 
ible with immunohistochemistry and imaging of endogenous pro- 
teins over large volumes (Kleinfeld et al., 201 1 ; Miyawaki, 2015), 
and multi-colored fluorescence signals can also help simplify the 
task of automated image segmentation for efficient data analysis. 
However, the diffraction-limited resolution of fluorescence mi- 
croscopy can lead to substantial errors in the identification and 
assignment of synapses within reconstructed circuits. 

Super-resolution fluorescence imaging overcomes the diffrac- 
tion limit (Hell, 2007; Huang et al., 2010) and may enhance our 
ability to reconstruct neural circuits by integrating high image 
resolution for synapse identification and assignment, protein- 
specific labeling for determining the molecular properties 
of synapses, and multi-color imaging for efficient data analysis. 
Here, we developed a super-resolution reconstruction platform 
by combining stochastic optical reconstruction microscopy 
(STORM) (Huang et al., 2010; Rust et al., 2006) with serial ultra- 
thin sectioning for large-volume reconstruction of endogenous 
molecular targets in tissues and used this platform to image 
entire neurons and their synaptic inputs. We focused our studies 
on the inner plexiform layer (IPL) of the mouse retina where 
diverse classes of retinal ganglion cells (RGCs) integrate synap- 
tic inputs (Anderson et al., 2011; Helmstaedter et al., 2013) to 
generate unique spatiotemporal representations of the visual 
scene (Gollisch and Meister, 2010). A classic example of such 
a computation is the determination of visual stimulus motion di- 
rection by On-Off direction-selective RGCs (On-Off DSGCs) (Va- 
ney et al., 201 2). The substantial prior knowledge of the structure 
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and function of this cell type allows validation of our method, 
while unresolved structural questions in this system provide an 
opportunity to test the ability of our approach to extract novel 
biological information. For example, On-Off DSGCs are known 
to receive asymmetric inhibitory GABAergic inputs from presyn- 
aptic starburst amacrine cells (SACs) during null-direction stim- 
ulus movement (Briggman et al., 2011; Fried et al., 2002; Wei 
et al., 2011), and the a2 subunit of GABA(A) receptor plays an 
important role in this direction selectivity (Auferkorte et al., 
2012). In addition to GABAergic synapses, glycinergic signaling 
also impacts the response of On-Off DSGCs to the edges of 
moving stimuli (Caldwell et al., 1 978; Jensen, 1 999), likely reflect- 
ing crossover inhibition between the on and off sublaminae 
mediated by glycinergic amacrine cells (Kittila and Massey, 
1995; Stasheff and Masland, 2002; Werblin, 2010). However, 
the structural basis of this crossover inhibition in On-Off DSGC 
circuits is incompletely understood, and it is unclear whether gly- 
cinergic interneurons make direct synaptic contacts onto On-Off 
DSGCs. To demonstrate the capabilities of our super-resolution 
platform, we reconstructed the inhibitory synaptic input fields of 
individual On-Off DSGCs and determined the spatial distribution 
and neurotransmitter receptor identity of the synapses therein. 
We also reconstructed the inhibitory input fields of two other 
types of retinal neurons, a small-field On-center RGC and a nar- 
row-field amacrine cell, for comparative demonstration. 

RESULTS 

Volumetric, Multi-color Super-Resolution 
Reconstruction 

We labeled neurons and synaptic proteins with spectrally 
distinct photoswitchable dyes for multi-color STORM imaging 
(Dempsey et al., 2011). For neuron labeling, we used mice ex- 
pressing GFP or YFP in the cytoplasm of a sparse subset of 
retinal neurons (Feng et al., 2000) and labeled the dissected 
retinal tissue with anti-GFP antibodies. For marking inhibitory 
synapses, we used an antibody against an inhibitory synapse 
scaffolding protein, gephyrin, which anchors glycine and/or 
GABA receptors at postsynaptic terminals (Tyagarajan and Frit- 
schy, 2014). For presynaptic counter-staining, we used a cock- 
tail of antibodies against several active zone proteins, bassoon, 
piccolo, munc13-1 , and ELKS, for dense labeling of all presynap- 
tic terminals. Table S1 shows all of the antibodies tested in 
this work. We also included a general neuropil stain, wheat 
germ agglutinin (WGA), in a fourth color channel to produce 
images with dense information content to assist serial-section 
alignment. 

For volumetric reconstruction, we embedded tissues in resin 
and used serial ultrathin sectioning, in combination with STORM 
imaging, to generate large-volume super-resolution images. 
Serial ultrathin sectioning not only facilitates large-volume fluo- 
rescence reconstruction of tissue samples but also allows the 
image resolution along the z direction (as defined by the section 
thickness) to be substantially higher than the diffraction limit, as 
has been demonstrated previously in array tomography and 
three-dimensional (3D) STED reconstructions (Micheva and 
Smith, 2007; Punge et al., 2008). The partial exposure of epi- 
topes in samples embedded in acrylic resin also allows many 



different synaptic proteins to be imaged through multiple rounds 
of post-embedding immunolabeling, which help identify synap- 
ses and characterize their molecular properties (Micheva et al., 
2010; Micheva and Smith, 2007). However, the requirement of 
sample embedding for high-quality serial sectioning poses extra 
challenges for super-resolution imaging. Since STORM imaging 
relies on switching and localization of individual fluorophores to 
reconstruct super-resolution images (Huang et al., 2010; Rust 
et al., 2006), the resolution of a STORM image depends not 
only on the localization precision of individual fluorophores 
determined by their photon output but also on the localization 
density determined by the labeling density. Achieving optimal 
STORM resolution thus requires the labeling and embedding 
conditions to simultaneously retain optimal fluorophore proper- 
ties and high-density labeling in resin-embedded samples. Resin 
embedding, however, substantially reduces the antigenicity of 
samples, which leads to a drastic reduction in antibody labeling 
density and severely compromises the image resolution achiev- 
able by STORM as we observed for tissues immunolabeled after 
acrylic resin embedding. Such low label densities, which are also 
evident in previous STORM images of tissue samples prepared 
using a similar post-embedding labeling approach (Nanguneri 
et al., 2012), prevent accurate tracing of neurons and identifica- 
tion of synapses using super-resolution imaging. We therefore 
explored pre-embedding immunofluorescence labeling (Punge 
et al., 2008) to increase the labeling density. We further tested 
various embedding materials and found that epoxy resin was 
excellent for maintaining the photon output of the fluorescent 
dyes. Finally, since optimal photoswitching of dye molecules re- 
quires access to a switching agent, such as thiol, we chemically 
etched the resin-embedded tissue sections using sodium ethox- 
ide solution to expose the dyes to the thiol-containing imaging 
buffer. 

Experimentally, we immunolabeled retinal tissues, performed 
an additional fixation step to crosslink the antibodies, dehy- 
drated the samples, and embedded them in UltraBed epoxy 
resin (Figure 1A). The resin-embedded tissues were cut into 
70 nm ultrathin sections, arrayed onto glass coverslips, and 
etched with sodium ethoxide (Figure 1A). Coverslips were 
imaged using a microscope setup that allowed automated imag- 
ing of entire arrays of sections, and both STORM and conven- 
tional images were collected for the same tissue sections. The 
xy-resolution of the STORM images was ~20 nm, and that of 
the conventional images was diffraction limited to ~200- 
300 nm, whereas the z resolution of both STORM and conven- 
tional images in this work was limited by the section thickness 
of 70 nm. 

We developed an automated image analysis pipeline for pro- 
cessing STORM and conventional images, which included cor- 
rections of chromatic aberration and lens distortions using 
bead fiducials, as well as montage and serial-section alignment 
using scale-invariant feature transformation (SIFT) followed by 
elastic registration (Saalfeld et al., 2012) to generate large-vol- 
ume reconstructions (Figure 1A) (see the Experimental Proce- 
dures for details). Volumetric STORM reconstructions of the 
IPL revealed efficient labeling throughout the sample, with neu- 
rons situated amidst hundreds of thousands of fluorescent clus- 
ters in each synaptic channel (Figures 1B, 10, S1 , and S2). 
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Figure 1. A Super-Resolution Imaging and Analysis Platform 

(A) Tissues were dissected, fixed for immunohistochemicai iabeiing, postfixed, dehydrated, and embedded in epoxy resin. Uitrathin sections were cut, arrayed on 
giass coversiips, and etched to expose fluorophores for STORM imaging, individuai seriai sections were imaged and aiigned to generate 3D reconstructions. 

(B) STORM maximum intensity projection of a voiume (2.3 x 10^ i^m^) of the mouse iPL containing an On-Off DSGC (biue) amidst presynaptic (magenta) and 
gephyrin (green) ciusters imaged using the piatform. 

(0) An eniarged image of synapses in asmaii region (1 ^im thickness) of the iPL. For comparison, the corresponding conventionai image of the upper right portion 
(to the right of the dashed iine) is presented. 

See aiso Figures SI and S2. 



Synapse Identification 

Taking advantage of the multi-color super-resolution fluorescent 
signals, we developed image segmentation algorithms for auto- 
mated neuron and synapse identification and performed quanti- 
tative analysis of entire fields of molecularly identified synapses 
in our datasets. Labeled synaptic proteins appear as clusters of 
localizations in STORM images, but not all clusters in STORM 
images represent synapses (Dani et al., 2010; Specht et al., 
2013). For synapse identification, we measured the volume 
and signal density of all fluorescent clusters. In both presynaptic 
and postsynaptic (gephyrin) channels, these two parameters 
separated fluorescent clusters into two distinct populations (Fig- 
ure 2A). We assigned the population of clusters with larger 
volumes as putative “synaptic” (S) structures and the other pop- 
ulation with smaller volumes as putative “non-synaptic” (NS) 
structures. The vast majority (~91 %) of the putative synaptic ge- 
phyrin clusters had closely apposed presynaptic clusters (Fig- 
ures S3A and S3C), and example pairs of gephyrin and presyn- 
aptic clusters from this population clearly resembled synapses 
(Figure 2B), supporting our assignment. Of the putative non-syn- 
aptic population of gephyrin clusters, only a small fraction had a 
nearby presynaptic cluster (Figure S3A). Moreover, because 
some of these small gephyrin clusters were spatially close to 
the larger, paired gephyrin/presynaptic clusters and were 
thereby falsely identified as being paired, the automated pairing 
analysis of these small gephyrin clusters (Figure S3A) was less 
accurate than that for the larger synaptic clusters. Visual inspec- 
tion showed that ~90% of these small gephyrin clusters were 



unpaired and likely represent gephyrin-containing trafficking 
vesicles or background signals from non-specific antibody label- 
ing, whereas the remaining small fraction of paired structures 
could represent small (potentially immature) synapses. 

For the presynaptic clusters, even the synaptic population 
contained a substantial fraction of clusters (~70%) that were 
not paired with gephyrin clusters (Figures S3B and S3D). This 
is expected, as the cocktail of antibodies against presynaptic 
active-zone proteins should label the presynaptic terminals of 
both excitatory and inhibitory synapses, and excitatory presyn- 
aptic terminals would not be expected to pair with gephyrin. 
The population of small presynaptic clusters did not show any 
appreciable pairing with gephyrin clusters (Figure S3B). 

These analyses demonstrated that we could identify synapses 
based on the size and signal density of the fluorescent clusters 
observed in STORM images. In the following experiments, we 
focus our analysis on the population of synaptic clusters with 
larger volumes. In contrast, similar analysis of either gephyrin 
or presynaptic clusters observed in the corresponding conven- 
tional images did not allow clear distinction between synaptic 
and non-synaptic clusters (Figure S4A). 

Examination of the hundreds of thousands of automatically 
identified synapses in STORM images of the inner retina 
showed non-uniform distributions across the depth of the IPL 
(Figure 2C). The difference between gephyrin-paired (inhibitory) 
and unpaired (putative excitatory) presynaptic cluster inten- 
sities divided the IPL into several sublaminae, two of which 
coincided with On-Off DSGC stratification in sublaminae S3 
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Figure 2. Automated Inhibitory Synapse Identification within the IPL 

(A) Gephyrin (left) and presynaptic (right) clusters across the IPL can be separated into putative synaptic (S) and non-synaptic (NS) populations based on the 
volumes and signal densities of the clusters. Shown are the 2D distributions of cluster volume and signal density constructed from all gephyrin and presynaptic 

(legend continued on next page) 
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and S7 (Figure 2D) (Vaney et al., 2012). Interestingly, the ge- 
phyrin and presynaptic signal intensities and the density and 
volumes of these gephyrin-positive inhibitory synapses all 
peaked in S3 and S7 (Figures 2E and 2F). As the size of inhib- 
itory synapses correlates with synaptic strength (Urn et al., 
1999; Nusser et al., 1997, 1998), this observation suggests 
that the inhibitory synapses subserving On-Off direction-selec- 
tivity may be among the strongest inhibitory connections in the 
mouse retina. 

Identifying Inhibitory Synaptic Inputs to Labeled 
Neurons 

To demonstrate the ability of our super-resolution platform to 
segment and analyze synaptic inputs onto identified neurons, 
we reconstructed two types of retinal ganglion cells and their 
associated inhibitory synaptic fields. Each of these datasets 
consisted of both STORM images (Figure 3A, left) and, for com- 
parison, the corresponding conventional images (Figure 3A, 
right). To identify synaptic inputs onto neurons, we measured 
the density of gephyrin clusters and associated presynaptic sig- 
nals as a function of distance to the neuron surface. Both density 
functions derived from STORM images were sharply peaked 
near the neuron surface with the gephyrin peak slightly inside 
the neuron and the presynaptic signal slightly outside the neuron 
as expected for input synapses (Figure 3B). These density 
peaks, in particular the gephyrin peak, were followed by a deple- 
tion zone, where the density dropped below the mean density of 
the surrounding IPL. For automated assignment of synapses to 
the neuron, we set a cutoff at the point where the gephyrin den- 
sity dropped below the mean density of the surrounding IPL and 
selected only those gephyrin clusters located at a distance 
below the cutoff as synaptic inputs to the neuron (Figures 3B 
and 3D, left, and Movie S1). 

Figure 3E and Movie S2 show the 1,017 inhibitory synapses 
assigned to a reconstructed On-Off DSGC. The number of syn- 
apses that we identified by STORM reconstruction here was 
similar to that estimated by previous EM reconstructions of 
SAC inputs to On-Off DSGCs (Briggman et al., 2011). Moreover, 
more than 98% of the synaptic gephyrin clusters assigned to the 
neuron had an apposing presynaptic partner. All of the gephyrin- 
presynaptic pairs assigned to the neuron were spatially oriented 
with the presynaptic structure more distant from the neuron than 
the postsynaptic structure (Figure 30), which is consistent with 
these structures being input synapses onto the neuron. 
Together, these results further demonstrated the high accuracy 
in our synapse identification and assignment. 



In comparison, assignment of synapses to neurons based on 
the corresponding conventional fluorescence images was less 
precise as the diffraction-limited resolution made it difficult 
both to identify synaptic clusters and also to set a proper cutoff 
value for assigning clusters to the neuron (Figure S4). As a result, 
this analysis resulted in substantial error rates (up to ~50%), de- 
pending on the selected cutoff distance (Figures 3D and S4C). 

Distribution of Inhibitory Inputs to On-Off DSGCs 

We next evaluated the size and position of all gephyrin-positive 
synapses within the dendritic arbor of each reconstructed cell. 
On-Off DSGCs, such as those shown in Figure 4A, exhibited 
non-random synapse distributions on both local and whole-cell 
scales (Figures 4B-4E). A Ripley’s clustering analysis showed 
that synapses were significantly more depleted within ~1 [im 
of another synapse than would be predicted by a random distri- 
bution on the dendritic arbor (Figures 4B and S5A), likely reflect- 
ing a minimum inter-synapse spacing imposed by the finite size 
of each synapse, which is consistent with a previous observation 
(Bleckert et al., 2013). On the whole-cell scale, On-Off DSGCs 
exhibited sublaminar specificity with substantially higher syn- 
apse density in sublaminae S3 and S7 than in other sublaminae, 
even after normalization for the different surface areas of den- 
drites across the IPL depth (Figures 40 and S5B). This pattern 
is consistent with the specific innervation of On-Off DSGCs by 
SACs (Vaney et al., 2012), which also stratify in S3 and S7. 

Distribution of Inhibitory Inputs to a Small-Field 
On-Center RGC 

For comparison, we examined the sizes and spatial distribution 
of inhibitory synapses (936 total) onto a small-field On-center 
RGC (Figures 4F and 4G-4J and Movie S3), a putative type G6 
as previously classified (Volgyi et al., 2009). Similar to On-Off 
DSGCs, synapses on this cell also exhibited a non-random 
spatial distribution on the local scale where Ripley’s clustering 
analysis showed an ~1-2 |im depletion zone in the vicinity of 
each synapse (Figure 4G). However, the inhibitory synaptic input 
field of this neuron exhibited less sublaminar specificity than 
On-Off DSGCs on a whole-cell scale (Figure 4H). 

Receptor Identity of Inhibitory Inputs to On-Off DSGCs 

To demonstrate the capability of this super-resolution fluores- 
cence reconstruction platform to determine the molecular 
identities of synaptic connections within neural circuits, we per- 
formed experiments to disambiguate different inhibitory synaptic 
input classes (GABAergic versus glycinergic) onto identified 



clusters identified in the image block, with the cluster volume plotted on the log scale. The signal density is defined as the fraction of the volume occupied by the 
cluster that is positive for the gephyrin or presynaptic signal. 

(B) Six example pairs of gephyrin (green) and presynaptic (magenta) clusters. The synapses are rotated to show side and en face views. 

(C) Projection images of the IPL showing gephyrin-paired and unpaired synaptic clusters in the presynaptic channel. GCL: ganglion cell layer; INL, inner nuclear 
layer. 

(D) The laminar distributions of the gephyrin-paired and unpaired presynaptic clusters divide the IPL into several sublaminae. Gray bars: presynaptic pairing index 
as a function of IPL depth. “Presynaptic pairing index” is calculated as the difference of the paired and unpaired presynaptic laminar intensity distributions after 
first standardizing each distribution to have a mean of zero and a SD of one. Blue bars: volume of the On-Off DSGC (|am^) per 0.5 ^im bin as a function of IPL depth. 

(E) The total signal intensity (left), average cluster density (middle), and average cluster volume (right) for each cubic micron of imaged tissue measured as a 
function of depth within the IPL for gephyrin clusters that are paired with presynaptic clusters. 

(F) Similar to (E) but for presynaptic clusters that are paired with gephyrin clusters. 

The delineation of sublaminae S3 and S7 in (E) and (F) was determined based on (D). See also Figure S3. 
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Figure 3. Automated Segmentation of Synaptic Inputs to Neurons 

(A) STORM maximum projection image of a region containing a dendritic 
branch of a reconstructed On-Off DSGC (ieft) and the corresponding con- 
ventionai image (right). Neurite is in biue, gephyrin in green, and presynaptic 
channei in magenta. 

(B) The densities of the gephyrin ciusters that are paired with presynaptic 
ciusters (green trace), the unpaired gephyrin ciusters (biue trace), and the 
gephyrin-paired presynaptic signai (magenta trace) measured as a function of 
the distance to the neuron surface. The distance at which the density peak of 
gephyrin ciusters drops beiow the mean synapse density of the surrounding 
I PL (dashed green iine) is used as a cutoff for defining gephyrin ciusters on the 
neuron. 

(0) For each synapse, we measured the distances of the presynaptic and 
postsynaptic signai to the neuron surface and defined the difference between 
these two distances as the reiative presynaptic-gephyrin distance from the 
neuron surface. Aii synapses assigned to the On-Off DSGC show positive 



neurons. We labeled retinae with either an antibody against the 
a2 subunit of the GABA(A) receptor (GABA(A)Ra2) or an antibody 
cocktail against all alpha subunits of glycine receptors (GlyRal- 
4), in addition to antibodies against GFP and gephyrin for 
marking neurons and inhibitory synapses, respectively. To deter- 
mine whether each gephyrin-positive inhibitory synapse con- 
tained GABA(A)Ra2 or glycine receptors, we examined whether 
the corresponding synaptic gephyrin cluster was paired with a 
specific receptor cluster by using the same approach described 
above for pairing presynaptic and postsynaptic structures (Fig- 
ures S6A and S6B). 

In GABA(A)Ra2-labeled samples, On-Off DSGC dendrites 
contained many GABA(A)Ra2-paired gephyrin clusters but strik- 
ingly rare unpaired gephyrin clusters (Figures 5A, 5B, and S6C). 
The gephyrin and GABA(A)Ra2 signal intensities in these synap- 
ses were strongly correlated with a Pearson coefficient of 0.82 
(Figure S6E), suggesting that gephyrin intensity in these synap- 
ses correlates with synaptic strength, as is the case elsewhere 
in the nervous system (Urn et al., 1999; Nusser et al., 1997, 
1998). Quantitatively, 97% ± 1 % of the gephyrin-positive synap- 
ses on On-Off DSGCs contained GABA(A)Ra2, suggesting a 
high labeling efficiency of the receptors. Compared with synap- 
ses on On-Off DSGCs, only ~45% of all gephyrin-positive syn- 
apses analyzed across the IPL contained GABA(A)Ra2, demon- 
strating a strong enrichment of GABA(A)Ra2 in the synapses 
onto On-Off DSGCs. Although not all GABA receptor types are 
anchored at synapses by a gephyrin scaffold (Brickley and 
Mody, 2012; Tretter et al., 2012; Tyagarajan and Fhtschy, 
2014), gephyrin-independent GABA receptors are unlikely to 
contribute to direction selectivity (Brickley and Mody, 2012; 
Massey et al., 1997). This, together with the similar synapse 
counts observed between our experiments and previous EM re- 
constructions of SAC synapses onto On-Off DSGCs (Briggman 
et al., 2011), suggests that the vast majority, if not all, of the inhib- 
itory synapses onto On-Off DSGCs are gephyrin positive. Hence, 
our observations suggest that nearly all of the inhibitory synap- 
ses onto On-Off DSGCs contain the GABA(A)Ra2 subunit. 

In stark contrast to the GABA(A)Ra2-labeled samples, in the 
GlyRa1-4 labeled samples, we observed very few GlyRa1-4 
positive synaptic gephyrin clusters on On-Off DSGCs (Figures 
50, 5D, and S6D). Quantitatively, only 8% ± 4% of the synaptic 



relative distance values (solid line), which is consistent with these pairs being 
input synapses onto the neuron. In contrast, the spatial arrangement of nearby 
synapses within 500 nm of the neuron (dashed line) shows a broad distribution 
of both positive and negative relative distance values, indicating a random 
orientation of nearby synapses with respect to the neuron surface. 

(D) Assignment of synapses in the STORM image based on the cutoff selected 
in (B) reveals adjacent presynaptic and postsynaptic structures associated 
with the neuron (left). In contrast, assignment of synapses in the conventional 
images with a cutoff at 0 nm (middle) or 150 nm (right) show false-negative 
(arrows) and false-positive synapse assignments (arrowheads). 

(E) En face view (top) and side view (bottom) of the STORM maximum intensity 
projection of a reconstructed On-Off DSGC (blue) with associated synaptic 
gephyrin (green) and presynaptic (magenta) clusters. Although gephyrin and 
presynaptic clusters are clearly resolved in the original reconstruction (Fig- 
ure 3D and Movie SI), they appear as overlapping white dots here due to 
image downsampling. 

See also Figure S4 and Movies S1 and S2. 



6 Cell 163 , 1-13, October 8, 2015 ©2015 Elsevier Inc. 







Please cite this article in press as: Sigal et al., Mapping Synaptic Input Fields of Neurons with Super-Resolution Imaging, Cell (2015), http:// 
dx.doi.org/1 0.1 01 6/j.cell.201 5.08.033 



Cell 





Laminar distribution 



Radial distribution 



Angular distribution 




Distance along neurite (^m) 
1D-Ripley’s clustering 



Dendritic arbor depth (pm) 



Laminar distribution 




w 0) 
o in 
in Q. 
Q. (0 
(0 c 
c >. 
>» in 
<0 ^ 




Radial distribution 



Angular distribution 




Distance from center (pm) 




Figure 4. Distributions of Inhibitory Synapses on an On-Off DSGC and a Small-Field On-Center RGC 

(A) Surface renderings of the On-Off DSGC (gray) shown in Figure 3E with aii inhibitory synaptic inputs marked by circies whose coior (biue to red) reflects gephyrin 
cluster intensity on a log scale. 

(B) A one-dimensional Ripley’s clustering analysis along the path of the skeletonized neuron. Negative value of the Ripley’s function K{t) - u at short inter- 
synaptic distances indicate that, near any given synapse, the density of other synapses is significantly lower than a random distribution (see Experimental 
Procedures for the definition of Ripley’s K function). 

(C-E) The laminar (0), radial (D), and angular (E) distributions of the inhibitory synapse densities on the On-Off DSGC. 

(F) Surface renderings of a small-field On-center RGC (gray) with all inhibitory synaptic inputs marked by circles whose color (blue to red) reflects gephyrin cluster 
intensity. 

(G-J) Similar to (B-E) but for the On-center RGC. 

Pink regions in (B-E) and (G-J) reflect 5/95% confidence intervals of random distributions derived from 1 ,000 randomizations of the synapse positions. 

See also Figure S5 and Movies S2 and S3. 



gephyrin clusters on On-Off DSGCs contained any GlyRa1-4 
signal, and even these synapses exhibited extremely sparse 
GlyRa1-4 labeling relative to nearby glycine-positive synapses 
not on the labeled On-Off DSGCs (Figures S6F and S6G). Since 
these nearby synapses contained substantial GlyRa1-4 signal, 
the lack of GlyRa1-4 in the On-Off DSGC synapses could not 
be attributed to low receptor labeling efficiency. Moreover, while 
previous work has shown a strong correlation between glycine 
receptor and gephyrin expression at synapses (Specht et al., 
2013), we observed little correlation between the intensity of ge- 
phyrin and GlyR signals for these GlyR-positive gephyrin clusters 
on the On-Off DSGCs (Figure S6E). These results suggest that 
these sparse, low-intensity GlyR punctae probably reflect non- 
specific background labeling, and even if they were specific 
synaptic labeling, they would contribute relatively little synaptic 
current due to the low receptor abundance. As gephyrin is 



required for clustering glycine receptors at synapses (Feng 
et al., 1998; Fischer et al., 2000; Kirsch et al., 1993), our results 
thus indicate that On-Off DSGCs in the mouse retina receive little 
monosynaptic glycinergic input. 

In contrast to the STORM results, analysis of the correspond- 
ing conventional fluorescence images showed that a substantial 
population (20%-30%) of the gephyrin-labeled “synapses” as- 
signed to On-Off DSGCs were GABA(A)Ra2 negative (Figure S7). 
These errors arise primarily from two sources: (1) it is difficult to 
separate synaptic gephyrin clusters from non-specific back- 
ground labeling or trafficking vesicles containing gephyrin based 
on conventional images (Figure S4A), and hence some of the ge- 
phyrin clusters assigned to the neuron may not correspond to 
synapses; and (2) synapses near the neuron, but not on the 
neuron, can be mistakenly assigned to the neuron because of 
the limited resolution of the conventional images (Figure S4C). 
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Figure 5. Receptor Identity of the Inhibitory 
Synaptic Inputs to On-Off DSGCs 

(A) Top: surface rendering of a central cross-section 
of an On-Off DSGC (gray) with GABA(A)Ra2-positive 
(+) and GABA(A)Ra2-negative (-) inhibitory synap- 
ses marked as magenta and green circles, respec- 
tively. STORM image of the boxed region is shown in 
the bottom panel. Neuron: blue. Gephyrin: green. 
GABA(A)Ra2: magenta. 

(B) The GABA(A)Ra2-paired gephyrin (green), ge- 
phyrin-paired GABA(A)Ra2 (magenta), and unpaired 
gephyrin cluster (blue) densities as a function of the 
distance to the neuron shown in (A). 

(0) Top: surface rendering of a central cross-section 
of an On-Off DSGC (gray) with GlyRal -4-positive (+) 
and GlyRal -4-negative (-) inhibitory synapses 
marked as magenta and green circles, respectively. 
STORM image of the boxed region is shown in 
the bottom panel. Neuron: blue. Gephyrin: green. 
GlyRal -4: magenta. 

(D) The GlyRal -4-paired gephyrin (green), gephyrin- 
paired GlyRal -4 (magenta), and unpaired gephyrin 
cluster (blue) densities as a function of the distance 
to the neuron shown in (0). 

See also Figures S6 and S7. 



Inhibitory Inputs and Outputs of a Glycinergic 
Interneuron 

Last, we imaged gephyrin-positive inhibitory synapses associ- 
ated with a subtype of narrow-field amacrine cell (NFAC) (Fig- 
ure 6A and Movie S4), putatively a Type 7 based on previous 
characterization (Pang et al., 2012). NFACs mediate crossover 
inhibition between On and Off sublaminae of the IPL via glyci- 
nergic inhibition (Werblin, 2010). In contrast to On-Off DSGCs, 
the surface of this NFAC was highly enriched with paired 
GlyRal -4 and gephyrin clusters but largely depleted of un- 
paired, GlyRal -4-negative gephyrin clusters (Figures 6A and 
6B). The resolution of STORM allowed us to visualize the orien- 
tations of gephyrin-receptor pairs relative to the neuron surface 
and determine whether these structures were input synapses 
onto the cell or output synapses from the cell (Figure 60). Un- 
like GABAergic synapses onto On-Off DSGCs, which were all 
input synapses (Figures 6D and 30), the glycinergic synapses 
on the NFAC contain both input and output synapses (Figures 
60 and 6D). Both synaptic inputs and outputs exhibited subla- 
minar specificity with enrichment in the Off sublaminae (Fig- 
ure 6E), suggestive of this cell being an On-center responsive 
Type 7 glycinergic amacrine cell (Pang et al., 2012) providing 
crossover inhibitory output to the Off sublaminae (Werblin, 
2010 ). 

About 85% of the synapses on this neuron contained glycine 
receptors (Figure 6E), again indicating a high receptor labeling 
efficiency in our samples. Since NFACs are glycinergic cells, it 
is not surprising that the observed output synapses from this 
cell were mostly GlyRal -4 positive. It is, however, interesting 
to observe that the majority of gephyrin-positive inputs onto 
this cell were also GlyRal -4 positive, suggesting that this type 
of NFAC receives inhibitory input signals mainly from other gly- 
cinergic amacrine cells, though our results do not exclude the 
possibility that this cell type also receives some GABAergic 
inputs. 



DISCUSSION 

Mapping the spatial organization and molecular identity of syn- 
aptic connections within neuronal networks is important for 
understanding how the nervous system functions. Here, we 
developed a super-resolution platform for volumetric recon- 
struction and automated segmentation of endogenous molecu- 
lar targets in tissue and demonstrated the ability of this platform 
to identify the spatial patterns and molecular identity of inhibitory 
synapses within neuropil, as well as onto individual neurons us- 
ing the mouse retina as a model system. 

This method provides several benefits for reconstructing syn- 
aptic connectivity. First, the superior resolution of this approach, 
as compared to conventional fluorescence imaging, allows more 
accurate identification of synapses and assignment of synapses 
to neurons. Indeed, when comparing results from the same tis- 
sue samples, we found that conventional fluorescence imaging 
led to substantial errors both in the identification of synapses 
and in the assignment of synapses to neurons even with the 
improved z resolution afforded by ultrathin sectioning. These er- 
rors resulted in misidentification of inhibitory synaptic types onto 
On-Off DSGCs, which could lead to substantial misinterpretation 
of cellular physiology. In addition, the resolution provided by 
STORM also allowed us to quantitatively measure synapse 
size, which is often a good indicator of synaptic strength (Nusser 
et al., 1997, 1998). This ability allowed us to map the relative 
strengths of inhibitory synapses at different locations both on 
identified neurons and across the IPL. 

A second benefit of the super-resolution reconstruction plat- 
form is its ability to use standard immunohistochemistry for 
labeling multiple endogenous protein targets of interest, which 
allows the determination of the molecular identities of synapses. 
Such information is difficult to ascertain using EM reconstruc- 
tions alone but is important for interpreting the function of spe- 
cific synapses in neural circuits (Bargmann and Marder, 2013). 



8 Cell 163 , 1-13, October 8, 2015 ©2015 Elsevier Inc. 





Please cite this article in press as: Sigal et al., Mapping Synaptic Input Fields of Neurons with Super-Resolution Imaging, Cell (2015), http:// 
dx.doi.org/1 0.1 01 6/j.cell.201 5.08.033 



Cell 



A 



B 



D 



GlyRa1-4 (+) inhibitory synaptic input 
GlyRa1-4 (+) inhibitory synaptic output 



GlyRa1-4(-) inhibitory synapse 




Paired Paired Unpaired 
gephyrin receptor gephyrin 




Distance from neuron surface (nm) 



c 



Inputs 
Gephyrin I 
GlyRa1-4 | 
Neuron 'I 



Outputs 
Gephyrin .1 
GlyRa1-4 (I 




GlyRa1-4(+)-NFAC 

GABA(A)Ro2(+) - DSGC 




-100 0 100 

Relative receptor-gephyrin 
distance from neuron surface (nm) 




INL Depth of IPL (%) GCL 



Figure 6. Input and Output Inhibitory Synapses of a NFAC 

(A) Surface rendering of a NFAC (gray) with GiyRa1-4 (+) input (purpie circies), output (orange circies), and GiyRa1-4 (-) synapses (green circies) shown. 

(B) The GiyRal -4-paired gephyrin (green), gephyrin-paired GiyRal -4 (magenta), and unpaired gephyrin ciuster (biue) densities as a function of the distance to the 
surface of the neuron. 

(C) Exampies of input and output synapses distinguished by the positions of receptor and gephyrin signais reiative to the surface of the neuron, input synapses 
have the gephyrin ciusters (green) in the dendrites (biue) and receptor ciusters (magenta) on the surface. Output synapses have the receptor ciuster immediateiy 
adjacent to the dendrite surface and the gephyrin ciusters farther outside. 

(D) The reiative dispiacement of receptor and gephyrin ciusters from the neuron surface, with positive vaiues indicating receptor being farther from the neuron 
(input synapses) and negative vaiue indicating gephyrin being farther from the neuron (output synapses). The soiid iine shows the distribution for giycinergic 
synapses associated with the NFAC, and the dashed iine shows the distribution for the GABAergic synapses associated with an On-Off DSGC. 

(E) Laminar distributions of the input (purpie) and output (orange) GiyRal -4-positive synapses and the GiyRal -4-negative synapses (biue) on the NFAC. 

See aiso Movie S4. 



Taking advantage of this capability, we showed that gephyrin- 
positive inhibitory synapses onto On-Off DSGCs were over- 
whelmingly GABAergic and each contained the GABA(A) 
receptor a2 subunit, suggesting that this receptor subunit is 
important for generating postsynaptic currents during motion 
detection. This result is consistent with previous data showing 
the enrichment of GABA(A)a2 in On-Off DSGC synapses and 
reduction in direction-selective responses in the GABA(A)a2 
knockout mouse (Auferkorte et al., 2012). Our reconstructions 
also showed that On-Off DSGCs receive little, if any, monosyn- 
aptic giycinergic input. These structural data, together with the 
observations that blocking GABA receptors largely eliminates 
inhibitory currents in On-Off DSGCs (Stafford et al., 2014; Tren- 
holm et al., 2011), suggest that giycinergic modulation of On-Off 
DSGCs does not occur via direct giycinergic inputs onto these 
neurons but likely through giycinergic inhibition of bipolar cells 
or SACs that are presynaptic to On-Off DSGCs (Ishii and Ka- 
neda, 2014; Majumdar et al., 2009; Zhang and McCall, 2012). 

A third strength of this reconstruction platform is its ability to 
perform automated segmentation of synaptic connections in 
neural circuits without manual annotation. This automated anal- 
ysis capability greatly speeds up the image processing required 
to extract biological information from individual reconstructions. 



For example, the image processing for volumetric reconstruction 
and segmentation of a whole On-Off-DSGC cell and associated 
synapses took <3 days of computation time without any need for 
manual segmentation or correction. In this work, the rate-limiting 
step of our reconstructions was the STORM image acquisition 
time, as imaging an entire On-Off-DSGC of 2.3 x 10^ |im^ in 
four color channels took ~3 weeks using a STORM setup equip- 
ped with an EMCCD camera. Our recent switch to a scientific 
CMOS (sCMOS) camera with a larger field of view and higher 
frame rate (Huang et al., 2013) shortened the imaging time of a 
comparable volume to ~3 days. We envision this automated im- 
aging and segmentation pipeline to be beneficial for determining 
neural circuit properties in different genetic mutant and disease 
models or at different time points during development, where a 
large number of reconstructions are needed. 

One potential limitation of this super-resolution fluorescence 
platform, as compared with EM approaches, is the density of 
neuronal processes that can be reconstructed within a volume. 
In this work, we reconstructed the spatial distributions and mo- 
lecular identities of synapses onto individual neurons in Thy1- 
GFP/YFP transgenic mice, in which only sparse subsets of 
neurons are labeled. We expect that our approach can be 
extended to the reconstruction of multiple, synaptically coupled 
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neurons using recently developed high-density, high-antigenic- 
ity, genetic labeling approaches (Cai et al., 2013; Loulier et al., 
2014; Viswanathan et al., 2015) or by microinjection of probes 
to directly label multiple neurons. Although the image resolution 
here was limited in the z direction by the 70 nm section thickness, 
we anticipate a substantial improvement in z resolution by using 
3D STORM (Huang et al., 2008). In particular, using high-preci- 
sion z-localization approaches (Jia et al., 2014; Shtengel et al., 
2009; Xu et al., 2012), the optical resolution can reach ~10 nm 
in all three dimensions. However, this resolution is still lower 
than that achievable by EM, and the labeling density may impose 
an additional limitation on resolution. Together, these may limit 
the density of neurites that can be reconstructed, and it remains 
to be determined whether this STORM platform can be used for 
dense reconstruction of all neurons in a volume. 

With its unique capabilities complementary to existing recon- 
struction methods, we expect that this volumetric super-resolu- 
tion reconstruction platform will enable a variety of synaptic 
connectivity analyses that will substantially enhance our under- 
standing of the structural basis of nervous system function. 
The ability to reconstruct and identify endogenous molecular tar- 
gets in large tissue volumes should also benefit the studies of 
many other biological systems. 

EXPERIMENTAL PROCEDURES 
Animals 

Animal work was performed in accordance with protocols approved by the 
Institutional Animal Care and Use Committee at Harvard University. Adult 
transgenic mice (Tg(Thy-1-EGFP)MJrs/J or YFP (Tg(Thy1-YFP)HJrs/J, The 
Jackson Laboratory) (Feng et al., 2000), both male and female animals 6- 
24 weeks of age, were used in our experiments. 

Retinal Tissue Preparation 

Whole eye-cups were immersion fixed in 4% paraformaldehyde for 10-60 min 
at room temperature. Both whole-mount and vibratome-sectioned retinae 
were used for labeling. For whole-mount labeling, retinae were laid flat on nitro- 
cellulose membranes, and individual labeled neurons were excised in circular 
punches (diameter --500 |im, thickness -^200 |am). For vibratome section la- 
beling, retinae were immersed in 37°C 2%-3% agarose, cooled on ice, and 
sectioned at 50-150 ^im thickness in 1 x DPBS. 

Immunohistochemistry 

Retinae were blocked in 10% normal donkey serum in 1 x DPBS with 0.3% 
Triton X-100 and 0.02%-0.05% sodium azide for 2-3 hr at room temperature 
and incubated in primary antibody solutions diluted in blocking buffer over- 
night for 3-4 nights at 4°C. A complete list of all primary antibodies tested in 
this work is provided in Table SI with the antibodies selected for the STORM 
reconstructions highlighted. Following primary antibody incubation, retinae 
were washed 6 times for 20 min each in 2% normal donkey serum in 1 x 
DPBS at room temperature and incubated in secondary antibodies (detailed 
in the Supplemental Experimental Procedures) overnight at 4°C for 1-2 nights 
to label the neuron with photoswitchable dye Atto 488 and two synaptic tar- 
gets (gephyrin and presynaptic proteins or gephyrin and receptors) with photo- 
switchable dyes Alexa Fluor 647 and DyLight 750, respectively. The antibodies 
for labeling synaptic proteins were also conjugated to Alexa Fluor 405 to facil- 
itate photoactivation of Alexa Fluor 647 and DyLight 750. Retinae were then 
washed 6 times for 20 min each in 1 x DPBS at room temperature and incu- 
bated overnight in Cy3B-labeled WGA. 

Postfixation, Dehydration, and Embedding in Epoxy Resin 

Labeled retinae were postfixed for 2 hr in 3% paraformaldehyde and 0.1% 
glutaraldehyde diluted in 1 x DPBS. Postfixed retinae were dehydrated in a 



graded series of ethanol washes (50%/70%/90%/100% two times) for 10- 
20 min each and then incubated in UltraBed Epoxy Resin (Electron Microscopy 
Sciences) solutions of increasing concentration for 2 hr each (75% ethanol/ 
25% resin; 50% ethanol/50% resin; 25% ethanol/75% resin; 100% resin 2 
times). Dehydrated resin blocks were then polymerized in UltraBed overnight 
for 16 hr at 70°C. 

Ultrathin Sectioning 

Ultrathin sections were cut at 70 nm on a Leica UC7 Ultramicrotome (Leica 
Microsystems) using an ultra Jumbo diamond knife (Diatome). The section 
thickness was verified in two independent ways, as described in the Supple- 
mental Experimental Procedures. Sections were collected on glass coverslips 
coated with 0.5% gelatin/0.05% chromium potassium sulfate. Coverslips 
were dried at 60°C for 25 min. 

Preparation of Coversiips for Imaging 

Coverslips of tissue sections were immersed in 10% sodium ethoxide solution 
for 5-20 min to etch the embedding resin for optimal photoswitching of dyes. 
Fluorescent beads (mixture of 540/560 and 715/755 FluoSpheres from Life 
Technologies, detailed in the Supplemental Experimental Procedures) were 
spotted on the coverslips as fiducial markers. Coverslips were secured to 
glass slide flow channels, filled with STORM imaging buffer (10% glucose/ 
17.5 laM glucose oxidase/708 nM catalase/10 mM MEA/10 mM NaCI/ 
200 mM Tris), and sealed with epoxy. 

imaging Setup 

Imaging was performed through Olympus UPlanSApo 1 0Ox 1 .4 NA oil-immer- 
sion objectives mounted on Olympus 1X71 inverted microscopes with back op- 
tics arranged for oblique incident angle illumination. The microscope con- 
tained a custom pentaband dichroic and pentanotch filter (Chroma 
Technology Corp) and laser lines at 488/561/647/750 nm (detailed in the Sup- 
plemental Experimental Procedures) for excitation of Atto 488, Cy3B, Alexa 
Fluor 647, and DyLight 750, respectively. A 405 nm laser was used for reacti- 
vation of dyes. Images were acquired on an Andor IXon3 897 or 897Ultra 
EMCCD camera through a QV2 quadview image splitter (Photometries). 
Each camera pixel corresponded to -^158 nm in sample space, and the total 
imaging field size was ^40 |am x 40 |am. Axial focus during imaging was main- 
tained in an automated manner as described previously (Dempsey et al., 
2011). 

Automated Image Acquisition 

Tissue sections and fiducial bead fields were initially located using a 4x 
objective. Regions of interest (ROIs) were subsequently identified with a 
lOOx objective. The stage position coordinates for each ROI were deter- 
mined, and the position list for all ROIs on a coverslip was then used to 
generate a master file that controlled laser illumination, camera activation, 
stage movement, AOTF control, and shutter sequences for automated 
STORM and conventional imaging. Each imaging session began with imag- 
ing of low-density bead fields by first exciting the 540/560 beads at 
488 nm and detecting in the Alexa Fluor 647, Cy3B, and Atto 488 channels 
and then exciting the 715/755 beads at 752 nm and detecting in the 
DyLight 750 and Alexa Fluor 647 channels. These low-density bead images 
were used for chromatic aberration correction across different color 
channels. 

Next, each ROI was imaged at the conventional resolution in each of the 
four color channels (DyLight 750, Alexa Fluor 647, Cy3B, and Atto 488). 
Next, images of the high-density bead field were acquired in each of the 
four color channels for (1) flat-field correction to compensate for non-uniform 
illumination across the field of view and (2) lens distortion correction at image 
field edges. 

STORM imaging of individual ROIs was next performed in four color chan- 
nels. For each ROI, the DyLight 750 channel was imaged for -^4K-4.5K frames 
at 30 Hz, the Alexa Fluor 647 channel was imaged for 6K-7K frames at 60 Hz, 
and the Cy3B and Atto 488 channels were each imaged for -^10K frames at 
60 Hz. To ensure that overlapping regions in each montage were not bleached, 
STORM movies were collected in two passes for each ROI, each consisting of 
half the total number of frames described above. 
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STORM Image Analysis 

STORM movies were analyzed to determine the positions of individual mole- 
cules using a DAOSTORM algorithm (Babcock et al., 2012; Holden et al., 
2011). Molecule lists were rendered as 2D images with 15.8 nm pixel size, 
which is close to both our ~20 nm STORM image resolution and 1/10 of the 
camera pixel size. For consistency of analysis, the conventional images 
were up-sampled to 15.8 nm/pixel. Chromatic aberrations were corrected 
using the transformation maps generated from the low-density bead field im- 
ages, and lens-induced optical distortions were corrected using transforma- 
tion maps generated from the high-density bead field images, as detailed in 
the Supplemental Experimental Procedures. 

Alignment of Multiple Image Tiles within Individual Sections 

Each STORM image was aligned to the corresponding conventional image 
using two-dimensional cross-correlation (Guizar-Sicairos et al., 2008). For 
mosaic imaging, Scale-Invariant Feature Transformation (SIFT) (Lowe, 
2004) was used to find points of similarity between overlapping regions in 
adjacent image tiles in the WGA channel and generate a rigid alignment 
transformation that was applied to the conventional and STORM images 
to stitch overlapping image tiles. On average, the residual offset in align- 
ment between SIFT points of similarity in two adjacent image tiles was 
<40 nm. 

Alignment of Serial Sections 

Corresponding SIFT features between adjacent sections were used to deter- 
mine a rigid linear transformation between sections, which was applied to all 
sections in the dataset to achieve a coarse, 3D rigid alignment of the data. 
Then, we applied elastic registration (Saalfeld et al., 2012) to further improve 
the alignment accuracy between adjacent sections while minimizing the global 
deformation of the entire image block. The warping transforms generated in 
these steps were applied to all conventional fluorescence and STORM 
channels. 

Segmentation of STORM and Conventional Fluorescence Images 

STORM images were first filtered using a mask generated from the conven- 
tional images to remove background and signals from occasional debris on 
the coverslip. To generate this mask, the signals in the conventional images 
were thresholded using the lower threshold of a two-level Otsu threshold 
method (Otsu, 1979) that divided the signals in our images into three classes 
with the lowest-intensity class representing the background, the highest in- 
tensity class representing neuronal and synaptic features, and the middle 
class representing other low-intensity signals above background. To identify 
the surface of the neuron, we smoothed the neuron signal with a Gaussian 
kernel with a = 47 nm and then binarized the neuron signal using the lower 
threshold of the two-level Otsu threshold method. To identify fluorescent 
clusters in the gephyrin, presynaptic or receptor channels in the STORM im- 
ages, we applied a 79 nm Gaussian convolution to the signal in the XY plane 
and an isometric Gaussian convolution (-^1 voxel) in Z and used the lower 
threshold of the two-level Otsu threshold method to binarize the image 
and identify connected components in three dimensions. Additional separa- 
tion of over-connected clusters was performed using a watershed transfor- 
mation. Processing of conventional images was performed similarly, except 
that we binarized the conventional images based on the higher threshold of a 
two-level Otsu threshold. 

Two-Dimensional Analysis to Separate Different Populations 
of Gephyrin and Presynaptic Clusters 

To determine whether a given cluster was synaptic, two parameters were 
considered for each cluster in the gephyrin and presynaptic channels: the vol- 
ume of the cluster was calculated from the connected components within the 
segmented image. Second, the signal density was measured as the fraction of 
volume of the connected components that was occupied by signal-positive 
voxels in the raw data. For STORM images, plotting the distribution of these 
two parameters constructed from all clusters in the dataset as a 2D histogram 
showed two peaks. Separation of the two populations is described in the Sup- 
plemental Experimental Procedures. 



Ripley’s K Function 

The Ripley’s K function is calculated as K{t) = < 0 /n, where t is the 

distance along neurites, X is the average density of synapses on the neuron 
skeleton, / is the indicator function, c/,y is the distance between the and 
synapses, and n is the number of synapses on the neuron, u is the average 
of /C(f) derived from 1 ,000 randomizations of synapse positions on the surface 
of the dendritic arbor. 

A detailed complete description of the experimental procedures can be 
found in the Supplemental Experimental Procedures accompanying this 
paper. 

SUPPLEMENTAL INFORMATION 

Supplemental Information includes Supplemental Experimental Procedures, 
seven figures, one table, and four movies and can be found with this article on- 
line at http://dx.doi.Org/10.1016/j.cell.2015.08.033. 
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SUMMARY 

Invasive lobular carcinoma (ILC) is the second most 
prevalent histologic subtype of invasive breast can- 
cer. Here, we comprehensively profiled 817 breast 
tumors, including 127 ILC, 490 ductal (IDC), and 88 
mixed IDC/ILC. Besides E-cadherin loss, the best 
known ILC genetic hallmark, we identified mutations 
targeting PTEN, TBX3, and FOXA1 as ILC enriched 
features. PTEN loss associated with increased AKT 
phosphorylation, which was highest in ILC among 
all breast cancer subtypes. Spatially clustered 
FOXA1 mutations correlated with increased FOXA1 
expression and activity. Conversely, GATA3 muta- 
tions and high expression characterized luminal A 
IDC, suggesting differential modulation of ER activity 
in ILC and IDC. Proliferation and immune-related 
signatures determined three ILC transcriptional 
subtypes associated with survival differences. Mixed 
IDC/ILC cases were molecularly classified as ILC-like 
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and I DC-like revealing no true hybrid features. This 
multidimensional molecular atlas sheds new light 
on the genetic bases of ILC and provides potential 
clinical options. 

INTRODUCTION 

Invasive lobular carcinoma (ILC) is the second most frequently 
diagnosed histologic subtype of invasive breast cancer, consti- 
tuting ~10%-15% of all cases. The classical form (Foote and 
Stewart, 1946) is characterized by small discohesive neoplastic 
cells invading the stroma in a single-file pattern. The discohesive 
phenotype is due to dysregulation of cell-cell adhesion, primarily 
driven by lack of E-cadherin (CDH1) protein expression 
observed in ~90% of ILCs (McCart Reed et al., 2015; Morrogh 
et al., 2012). This feature is the ILC hallmark, and immuno- 
histochemistry (IHC) scoring for CDH1 expression is often 
used to discriminate between lesions with borderline ductal 
versus lobular histological features. ILC variants have also 
been described, yet all display loss of E-cadherin expression 
(Dabbs et al.,2013). 
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Figure 1. Moiecuiar Determinants of Invasive Lobular Breast Cancer 

(A) Histopathological breast cancer subtypes: invasive ductai (IDC), invasive lobular (ILC), mixed ductal/lobular (Mixed), and other-type (Other) carcinoma. 
PAM50 intrinsic subtypes are not equally distributed across breast cancer subtypes. 

(B) Recurrently mutated genes (MutSigCV2) in ILC. 

(C) Comparison of the alteration frequency for 153 recurrent genomic alterations in ILC versus IDC. 

(D) Comparison of the alteration frequency for 153 recurrent genomic alterations in ILC LumA versus IDC LumA. 



Classic ILCs are typically of low histologic grade and low to 
intermediate mitotic index. They express estrogen and proges- 
terone receptors (ER and PR) and rarely show HER2 protein 
overexpression or amplification. These features are generally 
associated with a good prognosis, yet some studies suggest 
that long-term outcomes of ILC are inferior to stage-matched 
invasive ductal carcinoma (IDC) (Pestalozzi et al., 2008). Impor- 
tantly, ILC infiltrative growth pattern complicates both physical 
exam and mammographic findings and its patterns of metastatic 
spread often differ from those of IDC (Arpino et al., 2004). 

To date, genomic studies of ILC have provided limited insight 
into the biologic underpinnings of this disease, mostly focusing 
on mRNA expression and DNA copy-number analysis (McCart 
Reed et al., 2015). The first TCGA breast cancer study (Cancer 
Genome Atlas, 2012) reported on 466 breast tumors assayed 
on six different technology platforms. ILC was represented by 
only 36 samples, and no lobular-specific features were noted 
besides mutations and decreased mRNA and protein expression 
of CDH1 . Here, we analyzed nearly twice as many breast tumors 



from TCGA (n = 817), including 127 ILC. This study identified 
multiple genomic alterations that discriminate between ILC and 
IDC demonstrating at the molecular level that ILC is a distinct 
breast cancer subtype and providing new insight into ILC tumor 
biology and therapeutic options. 

RESULTS 

Genetic Determinants of Invasive Lobular Cancer 

A total of 817 breast tumor samples were profiled with five 
different platforms as previously described (Cancer Genome 
Atlas Research Network, 201 4) and 633 cases were also profiled 
by reverse-phase protein array (RPPA). A pathology committee 
reviewed and classified all tumors into 490 IDC, 127 ILC, 88 
cases with mixed IDC and ILC features, and 112 with other 
histologies (Table SI). As expected, lobular tumors were pre- 
dominantly classified as luminal A (LumA) (Figure 1 A) and being 
typically ER-i- tumors characterized by low levels of proliferation 
markers (Table SI). ER status was clinically determined by 
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immunohistochemistry on 120 of 127 ILC cases, with 94% (n = 
113) scoring positively. 

Within 127 ILC, we identified 8,173 total coding mutations, 
integrating information from both DNA and RNA sequencing 
(Wilkerson et al., 2014). Recurrently mutated genes in ILC were 
identified by MutSigCV2 (Lawrence et al., 2013) and included 
many genes previously implicated in breast cancer (Figure 1 B, 
Table 1) (Cancer Genome Atlas, 2012). Similarly, recurrent 
copy-number alterations in ILC estimated by GISTIC (Mermel 
et al., 201 1) recapitulated known breast cancer gains and losses, 
in particular those observed in ER+/luminal tumors (Figure SI A). 
However, the frequency of these alterations (both mutations and 
copy-number changes) often differed significantly between IDC 
and ILC. 

To investigate these differences, we identified recurrent alter- 
ations across all 817 samples and separately in ILC and IDC 
PAM50 subtypes (luminal A, n = 201, luminal B, n = 122, 
HER2-enriched, n = 51 , and basal-like, n = 1 07). In total, we iden- 
tified 1 78 events, including 68 mutated genes, 47 regions of gain, 
and 63 regions of loss (Table 1 and Table S2). Several of these 
had different incidence in ILC and in IDC (Figure 1C, Table S3). 
ILC cases were significantly enriched for CDH1 mutations 
(63% in ILC versus 2% in IDC, q = 3.94E-53), most of them trun- 
cating, and mutations affecting TBX3 (9% versus 2%, q = 0.003), 
RUNX1 (10% versus 3%, q = 0.008), PIK3CA (48% versus 33%, 
q = 0.02), and FOXA1 (7% versus 2%, q = 0.08). By contrast, 
alterations typically observed in ER-/basal-like tumors were 
less frequent in ILC, including TP53 mutations (8% in ILC versus 
44% in IDC, q = 1.9E-14) and focal amplification of MYC 
(6% versus 27%, q = 7.42E-7) and CCNE1 (0% versus 7%, q = 
0.01). These results partly reflect genetic differences between 
ER+/luminal and ER-/basal-like breast cancer, given that ILC 
tumors were predominantly LumA. Nonetheless, unexpected 
differences did emerge including a lower incidence of GATA3 
mutations in ILC compared to IDC (5% in ILC versus 13% in 
IDC, q = 0.03) (Figure 1C). 

To better identify ILC discriminatory features, we limited our 
analyses to LumA samples, representing 41% of IDC (n = 201) 
and 83% of ILC (n = 106) (Figure ID). This analysis confirmed a 
high incidence of CDH1 (q = 1.4E-30), TBX3 (q = 0.05), and 
FOXA1 (q = 0.065) mutations in ILC, while the frequency of 
RUNX1 and PIK3CA mutations was no longer significantly 
different. GATA3 mutations (5% ILC versus 20% IDC, q = 
0.003) were the second most discriminant event after CDFI1 mu- 
tations, mostly affecting IDC tumors. Interestingly, both FOXA1 
and GATA3 are key regulators of ER activity (Liu et al., 2014), 
suggesting IDC and ILC may preferentially rely on different 
mechanisms to mediate the ER transcriptional program. Finally, 
homozygous losses of the PTEN locus (10q23) were more 
frequent in ILC (q = 0.035) as were PTEN mutations (8% versus 
3%). Collectively, PTEN inactivating alterations were identified 
in 14% of LumA ILC versus 3% of LumA IDC (p = 9E-4), making 
this the third most discriminant feature between LumA IDC and 
LumA ILC (Figure ID). 

E-Cadherin Loss in Invasive Lobular Carcinoma 

Loss of the epithelial specific cell-cell adhesion molecule E-cad- 
herin (CDH1) is the key hallmark of ILC (Dabbs et al., 2013; 



Moll et al., 1993). CDH1 loss is believed to confer the highly dis- 
cohesive morphology characteristic of this tumor subtype and is 
often associated with tumor invasion and metastasis in other tu- 
mor types, including diffuse gastric cancer (Brinck et al., 2004; 
Cancer Genome Atlas Research Network, 2014; Richards 
et al., 1999). Loss-of-function mutations targeting CDH1 are pre- 
sent in 50%-60% of ILC and are believed to be an early event 
often observed in matching lobular carcinoma in situ (LCIS) 
(McCart Reed et al., 2015). CDH1 mutations typically occur in 
combination with chromosome 16q loss, where CDH1 is 
located, thus inducing complete loss of the protein. 

We identified 108 mutations in the coding sequence of CDH1 
in 107/817 patients (63%); 80 of these occurred in ILC cases. 
These mutations were rather uniformly distributed along the 
coding sequence, and 83% of them were predicted to be 
truncating (Figure 2A). CDH1 mutations almost invariably co- 
occurred with heterozygous loss of 16q (affecting 89% of 
ILC cases) and were associated with downregulation of both 
CDH1 transcript and protein levels (Figures 2B and S2A). 
By combining somatic mutations, copy-number losses, and 
mRNA and low protein expression (the latter when available), 
we identified E-cadherin alterations in 120/127 (95%) cases 
with DNA and RNA data, and in all 79 cases with DNA, RNA, 
and protein data (Figures S2A-SC). 

Previous studies reported sporadic cases of multiple cancer 
types with high DNA methylation levels at the CDH1 promoter, 
suggesting epigenetic silencing as an alternative mechanism 
for downregulation of CDH1 (Graff et al., 1997; Richards et al., 
1999; Sarrio et al., 2003; Zou et al., 2009). We analyzed the 
DNA methylation levels in breast tumors at CpG sites spanning 
from upstream of the CDH1 promoter, across the promoter 
CpG island, and extending into the first intron (Figure S2D). 
Despite four of these probes matching DNA positions previously 
reported as methylated in ILC (Graff et al., 1997; Sarrio et al., 
2003; Zou et al., 2009), we did not detect significant DNA hy- 
per-methylation at these probes (Figure S2E), nor in any of the 
other CDH1 associated probes analyzed (Figures 2B and S2F 
and S2G). Moderately increased methylation was observed in 
a few cases near exon 2; however, DNA methylation at this site 
correlated with lower tumor purity and increased leukocyte infil- 
tration and indeed it mimicked methylation levels at this CpG site 
in normal leukocytes (Figures S2F and S2G). Infinium DNA 
methylation results were validated by whole-genome bisulfite 
sequencing in five samples (Figure S2H). Altogether, these 
data confirm that CDH1 expression was substantially lower in 
ILC than in IDC and that this expression difference did not 
appear associated with DNA methylation at the CDH1 promoter. 
Our results on 81 7 invasive breast tumors thus confirmed E-cad- 
herin loss as ILC defining molecular feature but do not support 
the reported occurrence of CDH1 epigenetic silencing in invasive 
breast cancer. The discrepancy with prior literature may be 
attributable in part to the reliance on highly sensitive, but non- 
quantitative, methylation-specific PCR assays in past studies 
(Herman et al., 1996) and will require further investigation. 

FOXA1 Mutations in Breast Cancer 

FOXA1 is a key ER transcriptional modulator (Carroll et al., 2005; 
Hurtado et al., 2011) coordinating ER DNA binding within a large 
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Table 1. Recurrently Mutated Genes in Breast Cancer 



Gene 


ILC (n = 127) 
n q 


ILC Luminal A 
(106) 
n q 


IDC (490) 
n q 


IDC Luminal A 
(201) 
n q 


IDC Luminal B 
(122) 
n q 


IDC Her2- 
enriched (51) 
n q 


IDC Basal- 
like (107) 
n q 


ALL Breast 
Cancer (81 7) 
n q 


PIK3CA 


61 


1.02E-12 


54 


9.18E-13 


164 


6.09E-13 


93 


6.79E-13 


43 


6.76E-13 


19 


9.14E-13 


7 


6.22E-02 


282 


2.54E-13 


RUNX1 


13 


1.02E-12 


9 


9.18E-13 


13 


n.s. 


9 


1.32E-05 


3 


n.s. 


0 


n.s. 


1 


n.s. 


32 


2.54E-13 


CDH1 


80 


3.40E-12 


68 


6.12E-12 


10 


7.63E-03 


7 


5.33E-02 


2 


n.s. 


0 


n.s. 


0 


n.s. 


107 


2.54E-13 


TP53 


10 


8.26E-11 


6 


2.22E-04 


215 


6.09E-13 


28 


6.79E-13 


52 


6.76E-13 


37 


9.14E-13 


92 


1.83E-12 


280 


2.54E-13 


TBX3 


12 


2.54E-08 


10 


4.01 E-06 


8 


n.s. 


5 


n.s. 


2 


n.s. 


0 


n.s. 


1 


n.s. 


26 


1.1 IE-08 


PTEN 


9 


8.43E-08 


8 


8.86E-09 


27 


5.61 E-11 


6 


5.63E-03 


11 


9.64E-12 


4 


3.21 E-02 


6 


4.65E-03 


42 


2.54E-13 


FOXA1 


9 


5.52E-04 


9 


6.53E-04 


11 


n.s. 


5 


n.s. 


3 


n.s. 


2 


n.s. 


1 


n.s. 


30 


4.52E-13 


MAP3K1 


7 


2.95E-02 


6 


7.54E-02 


40 


4.06E-12 


33 


1.25E-11 


2 


n.s. 


1 


n.s. 


4 


n.s. 


69 


2.54E-13 


GATA3 


6 


n.s. 


5 


n.s. 


66 


6.09E-13 


40 


6.79E-13 


22 


6.76E-13 


3 


n.s. 


0 


n.s. 


96 


2.54E-13 


AKT1 


3 


n.s. 


3 


n.s. 


15 


5.61 E-11 


11 


1.25E-11 


3 


5.91 E-02 


1 


n.s. 


0 


n.s. 


20 


2.54E-13 


NBL1 


3 


n.s. 


2 


n.s. 


10 


1.08E-10 


8 


2.04E-12 


0 


n.s. 


1 


n.s. 


1 


n.s. 


16 


5.24E-11 


KMT2C 


9 


n.s. 


8 


n.s. 


37 


1 .49E-08 


17 


2.94E-02 


12 


n.s. 


3 


n.s. 


5 


n.s. 


64 


4.89E-06 


DCTD 


0 


n.s. 


0 


n.s. 


6 


1.02E-05 


3 


1 .54E-02 


1 


n.s. 


1 


n.s. 


0 


n.s. 


6 


7.61 E-04 


RB1 


0 


n.s. 


0 


n.s. 


16 


1 .46E-04 


4 


n.s. 


7 


n.s. 


1 


n.s. 


4 


n.s. 


18 


n.s. 


SF3B1 


4 


n.s. 


4 


n.s. 


12 


3.20E-04 


6 


1.12E-03 


4 


n.s. 


1 


n.s. 


1 


n.s. 


16 


3.68E-04 


CBFB 


2 


n.s. 


2 


n.s. 


15 


1.51 E-03 


13 


5.68E-06 


1 


n.s. 


1 


n.s. 


0 


n.s. 


24 


8.14E-13 


ARHGAP35 


1 


n.s. 


1 


n.s. 


13 


1.62E-03 


5 


n.s. 


5 


1 .24E-02 


0 


n.s. 


3 


n.s. 


18 


7.74E-03 


OR9A2 


0 


n.s. 


0 


n.s. 


5 


1.77E-03 


2 


n.s. 


1 


n.s. 


1 


n.s. 


1 


n.s. 


5 


6.47E-03 


NCOA3 


6 


n.s. 


4 


n.s. 


24 


1.77E-03 


7 


n.s. 


7 


n.s. 


4 


n.s. 


6 


n.s. 


40 


3.25E-07 


RBMX 


2 


n.s. 


2 


n.s. 


10 


2.83E-03 


3 


n.s. 


2 


n.s. 


1 


n.s. 


4 


6.27E-02 


12 


4.01 E-08 


MAP2K4 


2 


n.s. 


2 


n.s. 


24 


2.83E-03 


18 


5.29E-12 


5 


n.s. 


1 


n.s. 


0 


n.s. 


30 


1.37E-05 


TROVE2 


0 


n.s. 


0 


n.s. 


6 


4.51 E-03 


1 


n.s. 


2 


n.s. 


1 


n.s. 


2 


n.s. 


8 


2.77E-03 


NADK 


0 


n.s. 


0 


n.s. 


4 


4.51 E-03 


0 


n.s. 


4 


5.85E-05 


0 


n.s. 


0 


n.s. 


6 


3.61 E-03 


CASP8 


1 


n.s. 


1 


n.s. 


9 


6.00E-03 


2 


n.s. 


3 


n.s. 


0 


n.s. 


3 


n.s. 


11 


1.81 E-03 


CTSS 


0 


n.s. 


0 


n.s. 


5 


6.00E-03 


1 


n.s. 


2 


n.s. 


0 


n.s. 


2 


n.s. 


5 


8.91 E-02 


ACTL6B 


2 


n.s. 


1 


n.s. 


5 


7.63E-03 


2 


n.s. 


1 


n.s. 


0 


n.s. 


2 


n.s. 


10 


7.33E-05 


LGALS1 


0 


n.s. 


0 


n.s. 


4 


9.78E-03 


2 


n.s. 


2 


n.s. 


0 


n.s. 


0 


n.s. 


5 


6.34E-03 


KRAS 


2 


n.s. 


1 


n.s. 


4 


1 .54E-02 


3 


7.32E-03 


0 


n.s. 


0 


n.s. 


1 


n.s. 


7 


2.49E-04 


KCNN3 


2 


n.s. 


2 


n.s. 


8 


1.81 E-02 


1 


n.s. 


2 


n.s. 


2 


n.s. 


3 


2.45E-02 


16 


4.53E-02 


FBXW7 


2 


n.s. 


2 


n.s. 


6 


2.19E-02 


0 


n.s. 


0 


n.s. 


0 


n.s. 


6 


8.28E-04 


11 


n.s. 


LRIG2 


0 


n.s. 


0 


n.s. 


4 


3.08E-02 


2 


n.s. 


0 


n.s. 


1 


n.s. 


1 


n.s. 


6 


n.s. 


PIK3R1 


0 


n.s. 


0 


n.s. 


9 


3.08E-02 


2 


n.s. 


3 


n.s. 


2 


n.s. 


2 


n.s. 


13 


1 .56E-03 


PARP4 


3 


n.s. 


3 


n.s. 


7 


3.08E-02 


3 


n.s. 


4 


n.s. 


0 


n.s. 


0 


n.s. 


12 


n.s. 


ZNF28 


3 


n.s. 


3 


n.s. 


7 


3.25E-02 


1 


n.s. 


5 


n.s. 


0 


n.s. 


1 


n.s. 


11 


1.72 E-02 


HLA-DRB1 


0 


n.s. 


0 


n.s. 


13 


3.52E-02 


9 


1.49E-02 


2 


n.s. 


0 


n.s. 


2 


n.s. 


16 


n.s. 


ERBB2 


5 


n.s. 


4 


n.s. 


7 


6.42E-02 


3 


n.s. 


1 


n.s. 


2 


n.s. 


1 


n.s. 


18 


3.36E-06 


ZMYM3 


0 


n.s. 


0 


n.s. 


9 


8.83E-02 


3 


n.s. 


1 


n.s. 


1 


n.s. 


4 


n.s. 


11 


n.s. 


RAB42 


1 


n.s. 


1 


n.s. 


2 


n.s. 


0 


n.s. 


0 


n.s. 


0 


n.s. 


2 


6.27E-02 


4 


1.82E-03 


CTCF 


0 


n.s. 


0 


n.s. 


12 


n.s. 


9 


7.05E-08 


1 


n.s. 


1 


n.s. 


1 


n.s. 


18 


1 .93E-03 


ATAD2 


0 


n.s. 


0 


n.s. 


9 


n.s. 


2 


n.s. 


4 


7.32E-02 


2 


n.s. 


1 


n.s. 


12 


n.s. 


CDKN1B 


3 


n.s. 


2 


n.s. 


5 


n.s. 


4 


9.59E-02 


1 


n.s. 


0 


n.s. 


0 


n.s. 


11 


1.14E-03 


GRIA2 


0 


n.s. 


0 


n.s. 


6 


n.s. 


5 


5.33E-02 


0 


n.s. 


0 


n.s. 


1 


n.s. 


6 


n.s. 


NCOR1 


8 


n.s. 


8 


n.s. 


23 


n.s. 


12 


4.81 E-03 


7 


n.s. 


1 


n.s. 


3 


n.s. 


39 


3.61 E-03 


HRNR 


4 


n.s. 


4 


n.s. 


13 


n.s. 


3 


n.s. 


3 


n.s. 


3 


n.s. 


4 


n.s. 


23 


7.65E-02 


GPRIN2 


1 


n.s. 


1 


n.s. 


6 


n.s. 


3 


n.s. 


1 


n.s. 


0 


n.s. 


2 


n.s. 


11 


1.16E-05 


PAX2 


1 


n.s. 


0 


n.s. 


2 


n.s. 


2 


n.s. 


0 


n.s. 


0 


n.s. 


0 


n.s. 


4 


4.80E-02 



{Continued on next page) 
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Gene 


ILC (n = 127) 
n q 


ILC Luminal A 
(106) 
n q 


IDC (490) 
n q 


IDC Luminal A 
(201) 
n q 


IDC Luminal B 
(122) 
n q 


IDC Her2- 
enriched (51) 
n q 


IDC Basal- 
like (107) 
n q 


ALL Breast 
Cancer (81 7) 
n q 


ACTG1 


1 


n.s. 


1 


n.s. 


4 


n.s. 


2 


n.s. 


0 


n.s. 


0 


n.s. 


2 


n.s. 


8 


9.39E-02 


AQP12A 


0 


n.s. 


0 


n.s. 


3 


n.s. 


1 


n.s. 


1 


n.s. 


0 


n.s. 


1 


n.s. 


5 


2.69E-02 


PIK3C3 


2 


n.s. 


2 


n.s. 


5 


n.s. 


2 


n.s. 


0 


n.s. 


1 


n.s. 


2 


n.s. 


11 


3.23E-02 


MYB 


1 


n.s. 


1 


n.s. 


7 


n.s. 


3 


n.s. 


2 


n.s. 


0 


n.s. 


2 


n.s. 


12 


8.91 E-02 


IRS4 


1 


n.s. 


1 


n.s. 


6 


n.s. 


3 


n.s. 


0 


n.s. 


0 


n.s. 


3 


n.s. 


8 


9.38E-02 


TBL1XR1 


3 


n.s. 


2 


n.s. 


3 


n.s. 


1 


n.s. 


1 


n.s. 


1 


n.s. 


0 


n.s. 


12 


4.71 E-04 


RPGR 


4 


n.s. 


3 


n.s. 


11 


n.s. 


3 


n.s. 


3 


n.s. 


2 


n.s. 


3 


n.s. 


19 


1 .26E-03 


CCNI 


1 


n.s. 


1 


n.s. 


2 


n.s. 


0 


n.s. 


2 


n.s. 


0 


n.s. 


0 


n.s. 


3 


6.93E-02 


ARID1A 


7 


n.s. 


5 


n.s. 


16 


n.s. 


7 


n.s. 


4 


n.s. 


3 


n.s. 


2 


n.s. 


33 


7.91 E-09 


CD3EAP 


1 


n.s. 


0 


n.s. 


2 


n.s. 


0 


n.s. 


0 


n.s. 


0 


n.s. 


2 


n.s. 


5 


1 .29E-02 


ADAMTS6 


1 


n.s. 


1 


n.s. 


3 


n.s. 


1 


n.s. 


0 


n.s. 


0 


n.s. 


2 


n.s. 


8 


1.81E-03 


0R2D2 


0 


n.s. 


0 


n.s. 


4 


n.s. 


0 


n.s. 


3 


n.s. 


0 


n.s. 


1 


n.s. 


5 


5.67E-02 


TMEM199 


0 


n.s. 


0 


n.s. 


3 


n.s. 


0 


n.s. 


2 


n.s. 


1 


n.s. 


0 


n.s. 


4 


3.36E-02 


MST1 


0 


n.s. 


0 


n.s. 


5 


n.s. 


2 


n.s. 


2 


n.s. 


0 


n.s. 


1 


n.s. 


7 


9.46E-02 


RHBG 


0 


n.s. 


0 


n.s. 


3 


n.s. 


0 


n.s. 


0 


n.s. 


1 


n.s. 


2 


n.s. 


4 


7.91 E-02 


ZFP36L1 


1 


n.s. 


1 


n.s. 


5 


n.s. 


2 


n.s. 


2 


n.s. 


0 


n.s. 


1 


n.s. 


8 


3.37E-02 


TCP11 


2 


n.s. 


0 


n.s. 


3 


n.s. 


2 


n.s. 


0 


n.s. 


0 


n.s. 


1 


n.s. 


6 


4.80E-02 


CASZ1 


4 


n.s. 


4 


n.s. 


3 


n.s. 


0 


n.s. 


0 


n.s. 


1 


n.s. 


2 


n.s. 


11 


2.03E-02 


GAL3ST1 


1 


n.s. 


1 


n.s. 


2 


n.s. 


0 


n.s. 


1 


n.s. 


0 


n.s. 


1 


n.s. 


4 


7.74E-03 


FRMPD2 


1 


n.s. 


1 


n.s. 


7 


n.s. 


2 


n.s. 


4 


n.s. 


0 


n.s. 


1 


n.s. 


9 


8.91 E-02 


GPS2 


1 


n.s. 


1 


n.s. 


4 


n.s. 


3 


n.s. 


0 


n.s. 


1 


n.s. 


0 


n.s. 


8 


8.91 E-02 


ZNF362 


0 


n.s. 


0 


n.s. 


3 


n.s. 


3 


n.s. 


0 


n.s. 


0 


n.s. 


0 


n.s. 


3 


8.91 E-02 



n: number of mutations q: MutSigCV2 q value. 



protein complex by modifying chromatin accessibility and medi- 
ating long-range DNA interactions (Liu et al., 2014). High FOXA1 
expression has been previously reported in breast and prostate 
cancer (Habashy et al., 2008; Sahu et al., 2011) and somatic 
mutations in the FOXA1 gene have been reported in these tumor 
types in about 3%-4% of the cases (Barbieri et al., 201 2; Cancer 
Genome Atlas, 2012; Robinson et al., 2015). 

Here, we observed a total of 33 FOXA1 mutations in 30/817 
(3.7%) tumors (Figure 3A), and the large sample set allowed us 
to identify regional hotspots in the FOXA1 mutation distribution. 
Mutations clustered in the fork-head DNA binding (FK) and 
C terminus transactivation domains (Figure 3A). A similar muta- 
tional pattern was observed by combining multiple prostate 
cancer sequencing studies (Baca et al., 2013; Barbieri et al., 
2012; Grasso et al., 2012; Robinson et al., 2015) (Figure S3A), 
and confirmed by the TCGA prostate cancer project (Robinson 
et al., 2015). Thus, regional FOXA1 mutation hotspots are selec- 
tively altered in a tissue-independent fashion. 

Eleven FOXA1 mutations were observed in 9/127 (7%) ILC 
cases. All FOXA1 mutations in ILC were in the FK domain, 
whereas mutations in I DC (n = 11) were observed both in FK 
(n = 6) and other structural elements (n = 5), without a specific 
preference. The FK domain includes three a helices (HI, H2, 
H3), three p strands (SI , S2, S3) and two loops, typically referred 
to as “wings” (W1 , W2) (Figures 3B and S3B). FOXA1 mutations 



in FK clustered prevalently in the W2 loop. Notable exceptions 
were recurrently mutated residues II 76 (n = 4) and D226 
(n = 3). These residues are far from W2 in sequence space and 
located in different secondary structure elements; however, 
they are close (within 5 to 1 0 A) to residues in W2 in the 3D space 
(Figures 3C, 3D and S3C). In total, 22 out of 25 FK-mutations in 
our dataset fall into a restricted 3D space or “mutation structural 
hotspot” (MSH) (Figure 3D) indicating a selective pressure for 
targeting protein interactions and functions mediated by this 
region. Notably, 8/127 ILC cases have FOXA1 mutations within 
this MSH compared to 4/490 IDC cases (p = 6E-4), further sup- 
porting FOXA1 selected mutations as an ILC feature. 

FOXA1 DNA binding occurs mostly through helix H3, that rec- 
ognizes the binding motif and is stabilized by interactions medi- 
ated by its “wings” (Cirillo and Zaret, 2007; Gajiwala and Burley, 
2000; Kohler and Cirillo, 2010). Only a few residues predicted 
or experimentally shown to interact with the DNA were mutated 
(Figures 3B-3E) suggesting that these events are unlikely to 
affect FOXA1 DNA binding. FOXA1 is a pioneer factor that binds 
condensed chromatin and triggers DNA demethylation of its 
binding sites, making them accessible to transcription factors 
such as ER (Cirillo et al., 2002; Serandour et al., 2011). FOXA1 
activity can therefore be estimated by the methylation status of 
these sites where occupied FOXA1 DNA binding sites tend to 
be demethylated. We analyzed DNA methylation levels of the 
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Figure 2. E-Cadherin Loss in ILC 

(A) Mutations targeting the CDH1 gene target res- 
idues across the whoie-sequence and are mostiy 
predicted to be truncating (red). 

(B) Comparison of E-cadherin status between iLC 
and iDC reveais frequent hemizygous copy-num- 
ber iosses at the CDH1 iocus and downreguiation 
of both mRNA [iog 2 (RSEM)] and protein ieveis. 
See aiso Figures S2A-S2C. Average DNA methyi- 
ation ievei of 6 probes at the CDH1 promoter 
shows no change in DNA methyiation in both iLC 
and iDC sampies. See aiso Figures S2D-S2i. 
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3,976 most variable methyiation probes mapping to FOXA1 
binding sites (Table S3) and methylated in normal samples 
(Ross-Innes et al., 2012; Wang et al., 2012). DNA methyiation 
of these sites was substantially lower when FOXA1 and ESR1 
were highly expressed, while it remained high in FOXA 7 -negative 
cases and adjacent normal tissue (Figure 3F). Inverse correlation 
with FOXA1 expression (Pearson’s coefficient p = -0.54) was 
specifically observed for DNA methyiation at FOXA1 binding 
sites. Indeed, no correlation was found with methyiation at 
2,000 most variable probes with the same methyiation level in 
normal samples as FOXA1 binding sites (p = 0.07) (Figure 3F). 
These data support the hypothesis that FOXA1 mRNA expres- 
sion correlates with its activity. FOXA1 mutations were positively 
associated with its mRNA expression (p = 0.002) and maintained 
a similar anti-correlation with DNA methyiation at FOXA1 binding 
sites (Figure 3F). Finally, by examining mRNA expression of 
FOXA1 targets, defined as genes with a FOXA1 binding motif 
in the promoter or matching the genomic loci covered by the 
3,976 methyiation probes we analyzed (Table S3), no significant 
differences were identified and only a few genes showed moder- 
ate expression changes (Table S4). These data collectively indi- 
cate that FOXA1 mutations do not abolish protein function and, 
in fact, they may activate alternative mechanisms to affect ER 
transcriptional programs. 

Differential expression analyses between FOXA1 mutant and 
wild-type cases within distinct subsets of samples found consis- 
tent upregulation of neuroendocrine secretory proteins SCG1 
(CHGB) and SCG2, chemokine-like factor CMTM8, neuroendo- 
crine tumor associated transcription factor NKX2-2 and Kalli- 
krein serine proteases KLK12, KLK13, and KLK14 (Figure S3D 
and Table S5). While the relatively low number of FOXA1 muta- 
tions and breast cancer heterogeneity prevented the identifica- 
tion of strong transcriptional signals associated with FOXA1 mu- 
tations, several upregulated targets in FOXA1 mutant cases with 



DNA methyiation 



High 



Normals 

part of them consistently found as signifi- 
cant suggests these lesions might drive 
novel binding events. 

Interestingly, while ILC cases were en- 
riched for FOXA 7 mutations, and in partic- 
ular for those targeting the FK domain, 
ILC showed significantly fewer GATA3 
mutations, another key ER modulator. 
Mutations in GATA3 were more frequent 
in LumA IDC (Figure ID) and mutually 
exclusive with FOXA1 events. Moreover, LumA ILC tumors 
show lower GATA3 mRNA (p = 0.007) and protein (p = 2E-4) 
levels than LumA IDC (Figures S3E and S3F). Taken together, 
the differential expression patterns and enrichment for hotspot 
mutations of GATA3 in IDC and of FOXA1 in ILC, suggest a pref- 
erential requirement for distinct ER modulators in ILC and IDC. 

Akt Signaling Is Strongly Activated in ILC 

PTEN inactivation emerged as a discriminant feature between 
luminal A ILC and luminal A IDC. PTEN genetic alterations 
across all ILC cases included homozygous deletions (6%) and 
somatic mutations (7%), and were largely mutually exclusive 
with PIK3CA mutations (48%) (Figure S4A). 

Unbiased differential protein expression analysis (Table S6) 
based on RPPA data revealed significant lower PTEN protein 
expression (p = 4E-4) in LumA ILC compared to LumA IDC (Fig- 
ure 4A). Consistent with PTEN function as a negative regulator 
of Akt activity (Cantley and Neel, 1999; Song et al., 2012), ILC 
tumors also showed significantly increased Akt phosphorylation 
at both S473 (p = 0.004) and T308 (p = 7E-5) (Figure 4A). Up- 
stream of the Akt pathway, we found significant upregulation 
of total EGFR (p = IE-4) and phospho-EGFR at Y1068 (p = 
0.005) and Y1173 (p = 0.007), as well as phospho-STAT3 at 
Y705 (p = 7E-4), supporting upregulation of signaling axes 
converging on Akt activation (Wu et al., 2013). We also identified 
increased phospho-p27 at T157 (p = 0.002), an Akt substrate, 
and phospho-p70S6 kinase at T389 (p = IE-4), a direct mTOR 
target. Notably, ILC phospho-Akt levels were comparable with 
those typically observed in the more aggressive HER2-F and 
ER-/basal-like breast tumors (Figure S4B), which have uniformly 
high levels of PI3K/Akt signaling (Cancer Genome Atlas, 2012). 
Consistent with these results, we found significant upregulation 
of a PI3K/Akt pathway-specific protein and phospho-protein 
expression signature (Akbani et al., 201 4) in LumA ILC compared 
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Figure 3. Recurrent FOXA1 Mutations Cluster in the 3D Space and Correlate with High FOXA1 Activity 

(A) Recurrent FOXA1 mutations in 817 breast tumors cluster in the Fork-head DNA binding (FK) domain and in the C terminus trans-activation (TA) domain. 

(B) Secondary structure elements of the FK domain are not equally mutated. FOXAl mutations cluster in the W2 loop and rarely target residues interacting with 
the DNA. 

(C) Residue-residue minimum distances for all residues in the FK domain using the 3D structure of FOXA3 FK domain (PDB ID: 1VTN). Frequently mutated 
residues 1176 and D226 are close in the 3D space (but not in the sequence) to the residues in the W2 loop. See also Figure S3C. 

(D) 3D structure of the FK domain. Mutations in the W2 loop, in 1176, and in D226 form a mutational structural hotspot (MSH). 

(E) 3D structure of FK domain bound to the DNA molecule shows mutated residues (red) are not those in contact with the DNA (light blue). 

(F) Across all breast cancer subtypes (histopathology and PAM50), FOXAl mutations are associated with FOXAl high mRNA expression. FOXAl mRNA 
expression is highly correlated with ER mRNA expression [log 2 (RSEM)] and anti-correlated with DNA methylation at FOXAl binding sites consistent with FOXAl 
activity. DNA methylation of randomly selected probes was used as control. 



to LumA IDC (Figure 4B, Tables S1 and S6). Based on this signa- 
ture, we found nearly equivalent levels of PI3K/Akt signaling in 
LumA ILC and basal-like and HER2-I- IDC (Figure S4C). Finally, 
PARADIGM analyses (Vaske et al., 2010) showed increased 
activation of Akt signaling in LumA ILC relative to LumA IDC 
(Figure 4D). 

PTEN protein loss and increased Akt phosphorylation were 
observed in association with PTEN genetic alterations, as well 
as in multiple ILC PTEN wild-type cases indicating that additional 
mechanisms contribute to the activation of the pathway. 
While PIK3CA mutations were frequent in LumA ILC tumors, 
these mutations were not associated with increased levels of 
phospho-Akt or pathway activity in our dataset. Using MEMo 



(Ciriello et al., 2012), we highlighted multiple genetic alterations 
converging on Akt/mTOR signaling in 45% of the samples (Fig- 
ure S4D, Table S7). Among these, alterations acting upstream 
of Akt were identified in 40% of ILC cases and were associated 
with increased Akt phosphorylation and PI3K/Akt score (Fig- 
ure 4E), providing an apparent molecular explanation for Akt 
activation in these samples. Interestingly, these events included 
ERBB2 amplification and mutations (Figure 4E), both of which 
have been identified in relapsed ILC (Ross et al., 2013). 

ILC mRNA Subtypes 

Using mRNA-seq expression data from LumA ILC samples 
(n = 106), we identified three ILC subtypes termed reactive-like, 



512 Cell 163, 506-519, October 8, 2015 ©2015 Elsevier Inc. 






Cell 



A 



B 



PI3K.p110.alpha 

PI3K.p85 

PTEN 

INPP4B 

Akt 

Akt pS473 
Akt pT308 
EGFR 
EGFR pY1068 
EGFR pY1173 
STAT3 pY705 

mTOR pS2448 
p70S6K pT389 

PI3K/AKT Score 




PIK3CA ij> infill I I III l■■llllll l|l llllllllii||H lllll mil II I 
PTEN I I II I l| I I 



D 



o- 

F0X01 



Active family 



Phospho-Forkhead box 
transcription factor 



T 

( NR4A1 ) 



ospho-GSI 



I Higher in Lum A Ductal 
I Higher in Lum A Lobular 



^Activation O 

I I Family 



ooO 



Altered in MEMo module (n=51) 



Wild type in MEMo module (n=76) 



PI3K/Akt score 
pAKT 



I Ilf 



ERBB2 |||||||||> 

AKT1 
PTEN 
GAB2 II 
IKBKB 
MIR21 I II 
KRAS 
NF1 

PIK3CA 



Illlllll"' 



I I I 



III! 



Hill 



• vs. • 
p = 0.007 
p = 0.035 

10% 

2% 



6% 

2% 



■mil II I III lllmllllll■lll■l■lmllllllll■lllll■ 



Protein expression 




Signature Score 




Statistical Analyses 

* Higher in ILC * p<0.05 

* Lower in ILC ** p<0.005 

*** p<0.0005 

Genetic Alterations 

■ Truncating mutation ■ Missense mutation 

■ Hotspot mutation ■ In-frame del 

I High level I High level 

copy number loss I copy number gain 



Figure 4. Akt Signaling Is Highest in ILC Tumors 

(A) Differential protein and phospho-protein analysis between ILC LumAand IDC LumA reveals significant lower levels of PTEN, and higher levels of Akt, phospho- 
Akt, EGFR, phospho-EGFR, phopsho-STATS, and phospho-p70S6K in ILC LumA. 

(B) A PI3K/Akt protein expression signature is significantly upregulated in ILC tumors. See also Figures S4B-S4C. 

(C) Mutation and copy-number alterations in PIK3CA and PTEN 

(D) PARADIGM identifies increased Akt activity in LumA ILC tumors. 

(E) MEMo identified multiple mutually exclusive alterations in ILC converging on Akt signaling and associated with increased phospho-Akt and PI3K/Akt protein 
signature in these tumors. Hotspot are defined as follow: PIK3CA E542, E545, Q546, and H1047; ERBB2 L755, I767, V777; AKT1 E17; KRAS G12. 



immune-related, and proliferative (Figures S5A-S5I, Table S8). 
We then used a 3-class ILC subtype classifier (60 genes, Table 
S13) to score all ILC samples in the TCGA (n = 127) (Figure 5A) 
and METABRIC (Curtis et al., 2012) datasets (Table SI 2). Our 
analyses identified many significant genomic features that distin- 
guished each ILC subtype at the mRNA and protein/phospho- 
protein level; but no distinguishing somatic mutations or DNA 
copy-number alterations. 

Significant analysis of microarray (SAM) analysis (Tusher et al., 
2001) identified 1,277 genes differentially expressed between 
ILC subtypes (q = 0) (Figure 5A, Table S8). Of these, 1,005 
were highly expressed in reactive- 1 ike tumors, which had lower 
tumor purity as determined by ABSOLUTE (Carter et al., 2012) 
(Figures 5A and S5P), and included genes consistent with 
epithelial and stromal-associated signaling including keratin, 
kallikrein, and claudin genes as well as the oncogenes EGFR, 
MET, PDGFRA, and KIT (Table S8). The remaining 272 genes 
were highly expressed in immune-related tumors and include 
modulators of immunogenic signaling such as interleukins (IL), 
chemokine receptors and ligands, major histocompatibility com- 
plex, and tumor necrosis factors, as well as ID01 and IFNG 
(Figure 5A and Table S8). Interestingly, immune activity in this 
subset of tumors appears to be predominantly associated with 



macrophage-associated signaling as increased levels of CD68 
(p < 0.05), macrophage-associated colony stimulating factor 
(MacCSF), macrophage-associated TH1 (MacTHI), and T cell 
receptor (TCR) gene expression signatures (Iglesia et al., 2014) 
were observed in both the TCGA (Figure 5A) and METABRIC 
(Figure S5J-S5K) datasets. Finally, proliferative tumors were 
defined by low expression of each of these 1,277 genes 
(Figure 5A). Intriguingly, in each dataset (Figures 5 and S5K) 
proliferative tumors had higher levels of proliferation relative 
to reactive-like tumors (TCGA: p = 3.3E-09; METABRIC: p = 
0.018) and slightly higher or equivalent levels compared to 
immune-related ones (TGCA: p = 0.29; METABRIC: p = 0.008). 
Regardless of ILC subtype, ILC tumor proliferation was generally 
lower than all IDC subtypes (Figures S5L-S5M). 

With respect to previously reported RPPA-based subtypes 
(reactive or non-reactive), reactive-like ILC largely, but not 
entirely, comprised tumors classified as reactive (Figure 5B; 
p < 1 E-4), a subgroup characterized by strong microenvironment 
and/or cancer fibroblast signaling (Cancer Genome Atlas, 201 2). 
Examining protein and phospho-protein expression differences 
between ILC subtypes identified many significant features 
(Figure 5B and Table S6). Reactive-like tumors had higher 
levels of c-Kit (p = 4E-4), consistent with mRNA expression, total 
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(p = 0.004) and phosphorylated PKC alpha (S657, p = 0.002); 
beta catenin (p = 0.012) and E-cadherin (p = 0.011), although 
both beta-catenin and E-cadherin levels are significantly lower 
than in all IDC subtypes. Decreased levels were observed 
instead for p70S6 kinase (p = 0.017), Raptor (p = 0.027) and 
elF4G (p = 0.024). 

Immune-related tumors had higher levels of immune modulator 
STATS alpha (p = 0.019), PI3K/Akt targets phospho-PRAS40 
(T246, p = 0.016) and mTOR (S2448, p = 0.019), and total (p = 
0.004) and phospho-MEKI (S217-S221, p = 0.022). Consistent 
with the mRNA proliferation signature, tumors in the proliferative 
subtype have increased expression of cell-cycle proteins cyclin 
El (p = 0.036), FoxMl (p = 0.019), PCNA (p = 0.019), and prop- 
sho-Chkl (S345, p = 0.038) as well as DNA repair components 
RadSO (p = 0.007), RadSI (p = 0.007), XRCC1 (p = 0.028), and 
BRCA2 (p = 0.038). Decreased expression was observed for 
total (p = 0.014) and phospho-MAPK (T202-Y204, p = 0.038), 
and phosphorylated MEK1 (S217-S221, p = 0.019), PKC alpha 
(S657, p = 0.006), PKC beta (S660, p = 0.037), and Src (Y527, 
p = 0.026). Protein pathway signatures (Akbani et al., 2014) 
recapitulated these findings with proliferative tumors having 
increased levels of the cell cycle (p = 0.005) and DNA damage 
response (p = 0.014) signatures and a lower RAS-MAPK signa- 
ture (p = 0.031) score (Tables SI and S6). 

Using an integrative genomics approach, PARADIGM pre- 
dicted increased activation of the TP53, TP63, TP73 TCF/beta- 
catenin PKC, and JUN/FOS pathways in reactive-like tumors; 
increased activation of immune-modulators IL12 and IL23, 
IL12R and IL23R, JAK2 and TYK2 in immune- related ILCs 
(Baay et al., 2011; Duvallet et al., 2011; StrobI et al., 2011), 
and decreased activation of each of these pathways along with 
lower levels of MAPK3, RBI , and ERK1 (Figure 5C) \n proliferative 
ILC tumors. 

Lastly, we determined that reactive-like ILC patients had a 
significantly better disease-specific (DSS) (p = 0.038, FIR: 0.47) 
and overall survival (OS) (p = 0.023, FIR: 0.50) compared to 
proliferative ILC patients in the METABRIC dataset, which has 
a median follow-up of 7.2 years (compared to the TOGA median 
follow-up of less than 2 years), (Figure 5D). Consistent with 
these results, patients with more proliferative lobular tumors 
(i.e., greater than the median PAM50 proliferation signature 
score) had worse DSS (p = 0.025, HR: 2.0) and a tendency 
toward worse OS (p = 0.058, HR: 0.63) compared to patients 
with a lower proliferation score (Figures S5N-S50). No sig- 
nificant differences in DSS or OS were identified between 
the immune-related subgroup and either the proliferative or 
reactive-like subgroup. These results are consistent with previ- 
ous studies reporting that the reactive stromal phenotype is 
associated with a good prognosis in breast cancer while prolifer- 
ation is one of the strongest indicators of worse outcome in 
luminal/ER+ breast cancers (Ciriello et al., 2013b). 

Tumors with Mixed ILC and IDC Histology 

Histologically, ~3%-6% of breast tumors present both a ductal 
and a lobular component (Figure 6A). Pathologists currently clas- 
sify these tumors as mixed ductalliobuiar breast carcinoma or 
invasive ductal cancers with lobular features (Arps et al., 2013). 
There are, however, no defined criteria or uniform terminology 



for the classification of mixed tumors and as a consequence 
discordant clinical and molecular features have been reported 
(Bharat et al., 2009). Molecular profiling has the power to provide 
quantitative endpoints to compare the genetics of mixed tumors 
with those of pure ILC and IDC. In our dataset, 88/817 tumors 
(11%) were classified by as mixed ductal/lobular breast carci- 
nomas. We characterized these mixed tumors using multiple 
computational approaches integrating different data-types 
thus to determine whether they molecularly resembled IDC 
(IDC-like), ILC (ILC-like), or neither. 

We first analyzed the transcriptional landscape of mixed 
tumors using the ISOpure algorithm (Quon et al., 2013), which 
deconvolves the transcriptional signal of each queried tumor to 
estimate how much of it can be explained by one or more refer- 
ence populations and how much is unique. Interestingly, mRNA 
expression profiles of all mixed cases could almost completely 
be explained by either IDC or ILC reference populations, sug- 
gesting that these tumors separate into IDC-like and ILC-like 
cases and do not represent a molecularly distinct subtype (Fig- 
ure S6A). Based on this analysis, 32/88 mixed cases received 
an ILC-score greater than the IDC-score, and were therefore 
classified as ILC-like (Figure 6B). 

We next evaluated the resemblance of mixed tumors to IDC 
and ILC based on the previously determined selected set 
of copy-number alterations (CNAs) and mutations (Table S2). 
Mixed tumors were enriched for IDC recurrent CNAs and muta- 
tions when compared to ILC, and vice versa (Figure S6B), 
indicating ILC and IDC genetic alterations were both present in 
these tumors, either simultaneously or in separate IDC-like and 
ILC-like subgroups. We then compared each mixed tumor to 
ILC and IDC based on their genomic features, by adapting the 
OncoSign algorithm (Ciriello et al., 2013a). This approach identi- 
fied 19 ILC-like mixed samples characterized by ILC genetic 
features (Figures 6B and S6D). All CDH7 -mutated mixed cases 
were classified as ILC-like, indicating CDH1 status as a domi- 
nant feature in this analysis. A few CDH1 wild-type mixed cases 
were also classified as ILC-like and characterized by ILC-en- 
riched events such as mutations in RUNX1 (3/4 mutated cases), 
TBX3 (2/4), and FOXA1 (2/6). ILC-enriched alterations did not 
co-occur with IDC-enriched ones, further indicating that mixed 
tumors can be categorized into ILC-like or IDC-like subgroups 
and do not constitute a molecularly distinct subtype. 

Finally, we combined 428 CNA, including focal and arm-level 
alterations, 409 gene expression modules (Fan et al., 2011; 
Gatza et al., 201 4) and somatic mutations for 1 28 genes mutated 
in more than 3% of the cases into a single ElasticNet classifier 
(Zou and Hastie, 2005). This integrated ElasticNet predictor 
identified 27/88 mixed tumors as ILC-like. These were enriched 
for the LumA subtype, CDH1 mutations and loss of E-cadherin 
mRNA expression (Figures 6B and S6E). 

Overall, these approaches were highly concordant (Figures 6B 
and S6F) with 24/88 cases (18/57 LumA cases) being called 
ILC-like by at least two approaches, and 64 being called IDC- 
like (Table SI). ILC-like and IDC-like mixed tumors when 
compared to pure ILC and IDC, respectively, do not show signif- 
icant enrichment for specific genomic alterations, being mole- 
cularly similar to either one or the other subtype. Our analyses 
demonstrate that mixed histology tumors overwhelmingly tend 
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to resemble either ILC or IDC as opposed to representing a third 
distinct group. Moreover, IDC and ILC discriminant molecular 
features, in particular CDH1 status, could be used to stratify 
mixed tumors into ILC-like and IDC-like tumor subgroups. 

DISCUSSION 

In this study we provide the most comprehensive molecular 
portrait to date of ILC. E-cadherin loss was confirmed the ILC 
hallmark lesion, and we could identify CDH1 loss at the DNA, 
mRNA, and protein level in almost all ILC cases. Moreover, 12/ 
27 CDH1 mutations in non-ILC cases occurred in mixed tumors 
strongly resembling ILC at the molecular level. Surprisingly, we 
did not identify DNA hyper-methylation of the CDH1 promoter 
in any breast tumor, suggesting that E-cadherin loss is not epige- 
netically driven. In addition, ILC and IDC differed in the FOXA1 
and GATA3 mutational spectra, PTEN loss, and Akt activation. 
The lower incidence of GATA3 mutations in ILC and lower 
GATA3 mRNA and protein expression suggest that in LumA 
ILC tumors there is a preferential occupancy of ER in FOXA1 
bounded sites (Theodorou et al., 2013). Differential ER activity 
is also observed at the protein level where both total ER 
(p = 0.005) and phospho-ER (p = 2E-05) levels are reduced in 
LumA ILC versus LumA IDC. These findings in the context of 
recent data suggesting an improved response to the aromatase 
inhibitor letrozole as compared to tamoxifen in ILC (Metzger 
et al., 2012; Sikora et al., 2014) warrants further investigation. 

The chromatin remodeling factor EP300, also involved in ER 
modulation, is able to directly acetylate FOXA1, and EP300 
driven acetylation prevents FOXA1 DNA binding, but does not 
affect the protein when already bound (Kohler and Cirillo, 
2010). Intriguingly, five acetylation sites have been identified in 
the wings of the fork-head domain; three of them in W2 (K264, 
K267, and K270), where most of our newly observed FOXA1 mu- 
tations cluster. These observations lead to the hypothesis that 
FOXA1 mutations could alter EP300 dependent acetylation of 
FOXA1 without affecting EP300 modulation of ER. While a 
rigorous evaluation of the role of EP300 in breast cancer and 
how FOXA1 mutations interfere with it goes beyond the scope 
of this study, FOXA1 mutations, its correlation with FOXA1 
expression and lack of DNA methylation at its binding sites, 
and exclusivity with GATA3 mutations support these as events 
activating FOXA1 function and, thus, ER transcriptional program. 

The PI3K/Akt pathway is among the most altered in cancer 
providing tumor cells with enhanced growth and survival capa- 



bilities. Integrating protein and phospho-protein data with gene 
expression and pathway activity signatures, we consistently 
identified increased Akt signaling in ILC versus IDC. Notably, 
E-cadherin loss has been associated with Akt activation and 
EGFR overexpression (Lau et al., 2011; Liu et al., 2013). Lack 
of E-cadherin expression, which characterizes almost all ILC 
tumors, may thus provide a favorable cellular context for Akt 
activation. Recently, PI3K and Akt inhibitors entered clinical 
trials for several cancer types including breast cancer. Flere 
we showed that ILC has on average the highest levels of Akt 
activation, measured by phospho-Akt and PI3K/Akt signaling 
among all breast cancer subtypes (comparable to IDC basal- 
like), making selective inhibition of this pathway in ILC a particu- 
larly attractive strategy. 

Unbiased characterization of the ILC transcriptome showed 
a high degree of internal variability giving rise to three main 
subgroups: reactive-iike, immune-reiated, and proiiferative. 
While additional validation studies will clearly be required, we 
do observe increased expression of many druggable path- 
ways/targets including increased levels of phospho-mTOR and 
phospho-MEKI expression in the immune-reiated subgroup 
as well as increased SMO and ERK pathway activity in the 
reactive-iike subgroup. These results, coupled with difference 
in clinical outcome, suggest that these subgroups will be impor- 
tant for future studies focused on both the clinical and biological 
aspects of ILC. 

Finally, we showed that mixed ILC/IDC tumors could be 
separated into two major groups based on their molecular resem- 
blance to either ILC (ILC-like) or IDC (IDC-like). The ability to clas- 
sify cancers with mixed phenotypes based on the underlying 
biology has implications for clinical practice as well as furthering 
our understanding of the etiology of such lesions. Indeed, ILC 
carcinomas often metastasize to body sites not colonized by 
IDCs (e.g., gastrointestinal [Gl] tract and peritoneal surfaces). 
ILC are also typically of low histologic grade and with low to inter- 
mediate mitotic index, thus limiting their response to primary 
chemotherapy (Cristofanilli et al., 2005) and their ability to be de- 
tected on PET scans. As such, clinicians must be aware of non- 
specific symptomatology and favor diagnostic approaches 
such as anatomical scanning (CT scan) for ILCs. Finally, the iden- 
tification of ILC enriched molecular features may ultimately lead 
to the design of ILC-targeted therapies. A more refined classifica- 
tion of mixed cancers as IDC-like or ILC-like will improve our un- 
derstanding, detection, and follow-up of the disease, and enable 
a more informed and targeted treatment selection. 



Figure 5. ILC Molecular Subtypes 

(A) Three molecular subtype of lobular breast cancer were identified based on differentiai gene expression and show unique patterns highiy expressed genes 
(n = 1277, SAM FDR = 0, upper panei), minor difference in tumor purity measured by ABSOLUTE, and differences in gene expression signatures measuring 
proiiferation, CD68, Macrophage-associated CSF1, Macrophage-associated TH1, and T Ceii Receptor Signaiing (iower panei). Proiiferation is highest in the 
proliferative (Pro) and immune-related (iR) subgroups; macrophage associated signaiing is highest in immune-related tumors. 

(B) Differences in protein expression profiies as determined by RPPA anaiysis. The reactive-iike (RL) subgroup shows a significant association (p < 1 E-4, Fisher’s 
Exact test) with the RPPA-defined reactive subgroup of breast cancer. Differences in subgroup-specific patterns of protein expression (p < 0.05, t test) for 
individuai proteins (upper panei) as weii as for protein expression signatures (iower panei) were identified. The proiiferative subgroup shows higher expression of 
the ceii-cycie and DNA damage response pathways and iower ieveis of Ras-MAPK signaiing (p < 0.05). 

(C) Subgroup-associated signaiing features identified by PARADiGM. 

(D and E) Reactive-iike (n = 55) tumors have significantiy better (D) disease specific (p = 0.038, HR: 0.47, iog-rank) and (E) overaii survivai (p = 0.022, HR = 0.50) 
compared to proliferative (n = 44) tumors in the METABRiC cohort. 

See aiso Figure S5. 
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Figure 6. Molecular Classification of Mixed 
Ductal/Lobular Carcinoma 

(A) Mixed ductal/lobular tumors present at the 
same time both a iobuiar and ductai component. 

(B) We used three aigorithmic approaches 
(EiasticNet, OncoSign, and ISOpure) to evaluate 
the resemblance of mixed tumors to either ILC 
(ILC-like) or IDC (IDC-like) based on molecular 
features. ILC-IDC scores are shown for all three 
approached at the top. See also Figures S6B-S6D. 

(C) Genetic alterations enriched in ILC tumors 
are frequently found in ILC-like mixed cases 
(in particular CDH1 mutations), whereas those 
enriched in IDC are more frequent in IDC-like 
mixed cases. 
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(D) ILC-like mixed-cases are characterized by 
both low E-cadherin mRNA and protein level. 
See also Figure S6E. 



This multi-platform study identified numerous molecular fea- 
tures discriminating between breast ILC and IDC, demonstrating 
different pathways underlying their pathogenesis, defining new 
ILC subtypes with different clinical outcomes, and pointing to 
previously unrecognized therapeutic possibilities. Importantly, 
we provided here a curated and integrated dataset for 817 
breast tumors, including the largest collection to date of com- 
prehensively profiled ILC. To facilitate the exploration of this 
dataset, we created a public web-service (http://cbio.mskcc. 
org/cancergenomics/tcga/brca_tcga) organizing all analyses 
and data used in this manuscript. We believe this resource will 
serve as a reference for many to further advance our understand- 
ing of human breast cancers. 

EXPERIMENTAL PROCEDURES 

Tumor and matched normal specimen were collected as previously described 
(Cancer Genome Atlas, 2012). In total 81 7 primary tumor samples were assayed 
by whole-exome DNA sequencing, RNA sequencing, mlRNA sequencing, SNP 
arrays, and DNA methylation arrays. A subset of 633 samples was assayed by 
reverse phase protein array (RPPA). Histological subtypes have been deter- 
mined based on consensus by a pathology committee. Intrinsic breast cancer 
subtyping was performed on all 817 cases, using the PAM50 classifier (Parker 
et al., 2009). Data generation and processing were performed as previously 
described (Cancer Genome Atlas Research Network, 2014). 

Enrichment analyses for selected events were performed using Fisher’s 
exact tests and a binary representation of copy-number alterations and muta- 
tions (1 is altered, 0 is wild-type). 

DNA methylation of the CDH1 promoter was assessed at probes within a 
window 1,500 bp upstream and downstream CDH1 transcription start site 
using both HM27 and HM450 data. Whole-genome bisulfite sequencing was 
performed to characterize DNA methylation levels at 157 CpGs. 

Distances between of FOXA1 mutations have been determined from the 
tertiary structure of FOXA3 fork-head domain (PDB ID: 1 VTN). Predicted DNA 
interactions were derived by WebPDA (http://bioinfozen.uncc.edu/webpda). 
Differential expression analyses on RNA-seq data were performed using the 
limmavoom package (Law et al., 2014). 

Replication Based Normalized (RBN) RPPA data containing expression 
levels for 187 protein and phosphorylated proteins for 633 samples were 
used for protein differential expression analysis. Differential pathway activity 
was assessed by t test. 

ILC subtypes were determined using Consensus Cluster Plus Analysis 
(Wilkerson and Hayes, 2010) based on the 1 ,000 most differentially expressed 
genes and a classifier was built using ClaNC (Dabney, 2006). 



Detailed description of each analysis presented in this study can be found 
within the Supplemental Experimental Procedures. 
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Ligand-Dependent Enhancer Activation Reguiated 
by Topoisomerase-I Activity 

Janusz Puc, Piotr Kozbial, Wenbo Li, Yuliang Tan, Zhijie Liu, Tom Suter, Kenneth A. Ohgi, Jie Zhang, Aneel K. Aggarwal, 
and Michael G. Rosenfeld* 

*Correspondence: mrosenfeld@ucsd.edu 
http://dx.d 0 i. 0 rg/l 0.101 6/j.cell.201 5.08.044 

(Cell 160 , 367-380; January 29, 2015) 

Our paper studied the role of the DNA nickase TOPI in enhancer activation and transcription of eRNA in androgen receptor 
(AR)-regulated transcriptional programs in prostate cancer cells. The central findings of the paper were that TOPI is recruited to 
AR-bound active enhancers, along with elements of the DNA damage repair machinery, and that DNA nicking activity of TOPI is 
required for robust eRNA synthesis and enhancer activation. 

It has come to our attention that, in Figures 5 and S5A, showing the kinetics of recruitment of several members of the DNA damage 
repair machinery to the KLK3, KLK2, TMPRSS2, and NDRG1 enhancer regions, the presented panels showing the results of the 
qPCR of the Ku80 and p-S1983-ATM ChIPs were identical. We retrieved the original files used to prepare the figures in the paper 
and were able to verify that the values depicted in the p-SI 983-ATM graphs were incorrect. We reconstructed all steps of the prep- 
aration of the published figures and concluded that the errors originated when data were compiled using GraphPad software and we 
mistakenly copied the Ku80 values into p-SI 983-ATM plots. Using the raw data from the original experiments, we were able to plot 
the correct numbers and we now provide corrected versions of Figures 5 and S5 below. The figures have also been corrected online. 
The correct data support the original interpretation that pATM is maximally recruited to enhancers ~60 min after dihydrotestoster- 
one-induced binding of androgen receptor, and the conclusions of the paper remain unchanged. We apologize for not identifying 
these errors before publication and for any inconvenience that they may have caused. 
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Figure 5. Canonical DNA Damage/Repair Machinery Components Recruit to AR-Regulated Enhancers 
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Figure S5. Kinetic ChIP Studies Performed on Additional AR Enhancers, Related to Figure 5 
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Pathogen Cell-to-Cell Variability Drives 
Heterogeneity in Host Immune Responses 

Roi Avraham, Nathan Haseley, Douglas Brown, Cristina Penaranda, Humberto B. Jijon, John J. Trombetta, Rahul Satija, 
Alex K. Shalek, Ramnik J. Xavier, Aviv Regev, and Deborah T. Hung* 

*Correspondence: hung@molbio.mgh.harvard.edu 
http://dx.d 0 i. 0 rg/l 0.1 01 6/j.cell.201 5.09.044 

(Cell 162 , 1309-1321; September 10, 2015) 

In the above article, the authors inadvertently included an incorrect GEO accession number, GSE66528. The article has been 
corrected online to reflect the correct accession number, GEO: GSE65528. 
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Snapshot: Olfactory Classical 
Conditioning of Drosophila 
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Classical conditioning with odor cues occurs when a specific odor conditioned stimulus (CS) is paired with an aversive unconditioned stimulus (US) such as electric shock or an appetitive US such 
as a food reward (Tomchik and Davis, 2013). This conditioning produces avoidance behavior to odors associated with an aversive US and approach behavior to those associated with an appetitive 
US. The memory components from such conditioning include acquisition; short-, intermediate- (consolidation), and long-term memory; active forgetting; and memory expression. Many different 
types of neurons within the olfactory nervous system— brain structures primarily involved in the processing of odorant cues— contribute to olfactory memory formation. 

1. Conditioned Stimulus Pathway: The sensation and perception of specific odors (yellow circles) occurs from the activation of olfactory receptor neurons (ORn) in the antenna. Odor 
identity is established by the activation of odor-specific sets of projection neurons (Pn, red) located in the antennal lobe (AT, turquoise), the equivalent of the olfactory bulb in vertebrates (Masse 
et al., 2009). The Pn transmit odor identity to synapses (red branches) with the mushroom body neurons (MBn, blue circles) in the dorsal posterior brain. The MBn fall into three major classes, 
the a/p, aVP', and y MBn (see 2). Their axons project through neuropil known as the horizontal (H) and vertical (V) MB lobes (blue), named according to the class of axons they contain. MBn are 
thus third order in the olfactory pathway (Tomchik and Davis, 2013). 

2. Unconditioned Stimulus Pathway: Accumulated evidence indicates that dopaminergic neurons (DAn) are necessary and sufficient to serve as the US signal (Waddell, 2013). Two 
clusters of DAn (PPLl and PAM) serve this purpose and innervate segments of the horizontal and vertical lobes. Those DAn providing aversive and appetitive US are colored red and green, 
respectively, along with the MB lobe regions that they innervate. 

3. Acquisition: Acquisition occurs, in part, through the temporal coincidence of the odor CS and the aversive or appetitive US. MBn are intimately involved in acquisition. Informa- 
tion about the odor CS conveyed by Pn is transmitted to the MBn by the activation of MBn-expressed ACh receptors. This leads to an increase in intracellular calcium concentration. The US 
information is transmitted from DAn to MBn using the dopamine receptor, dDAl. Increased calcium, along with the activation of the dDAl receptor on the MBn, produces a synergistic increase 
in cAMP concentration. The mobilization of the cAMP signaling system is essential for acquisition. Many different molecules are required for acquisition, including the relevant receptors, 
calmodulin-activated adenylyl cyclase, neurofibromin, and others. GABAergic input to receptors expressed by MBn from the APLn constrains acquisition (Tomchik and Davis, 2013). 

4. Short-Term Memory (STM): Short-term memory is brought about, in part, through plastic changes in the MBn and other cell types due to the transient increase in cAMP level. Genet- 
ics and molecular biology have identified many different molecules required for STM, including various synaptic proteins, ion channels, cAMP signaling molecules, and others. A memory trace 
corresponding to STM, observed as an increased calcium response to the GS-i- odor after aversive conditioning, is observed in the axons of the a7(3' MBn (Tomchik and Davis, 2013; Davis, 2011). 

5. Intermediate-Term Memory (ITM)/Consolidation: Gonsolidation into protein-synthesis-dependent LTM occurs across the first few hours after conditioning, making it difficult to 
distinguish effects of mutations on a specific temporal form of memory— ITM— or the process of consolidation. ITM/consolidation requires the reciprocal interactions between dorsal paired 
medial neurons (DPMn) and the MBn. The amnesiac gene product, expressed by the DPMn, is required for the expression of protein-synthesis-dependent LTM. A memory trace corresponding 
to ITM is observed in the processes of the DPMn innervating the vertical MB lobes (Tomchik and Davis, 2013; Davis, 2011). 

6. Long-Term Memory (LTM): Protein-synthesis-dependent LTM requires the actions of the transcription factor, Greb, in the MBn along with genetic functions for local translation 
(staufen, pumilio, oskar, Gpeb) and a host of other genetic functions specifically involved in LTM. In addition to the MBn, the dorsal anterior lateral neurons (DALn, not shown) are required for 
LTM. A memory trace corresponding to LTM is observed in the vertical axons of the a/(3 MBn (Dubnau et al., 2003; Davis, 2011). 

7. Active Forgetting: One mechanism for active forgetting involves release of dopamine (DA) from a small group of DAn (red) clustered near the MB calyx (G). These DAn innervate the 
MBn axons in an area joining the vertical and horizontal lobes. DA activates a receptor named DAMB that initiates the forgetting of memory traces stored in the MBn. This DA signal is sup- 
pressed by the state of sleep and is activated by sensory stimuli (Berry et al., 2015). The intracellular signaling for this receptor is at present unclear but may involve the downstream activation of 
the small G protein, Rac (Shuai et al., 2010). 

8. Memory Expression: Synaptic output from a small group of glutamatergic neurons (MBOn-M4/6, green) that project dendrites into the distal tips of the horizontal lobes is required 
for the expression of both aversive and appetitive ITM and appetitive LTM. These neurons project their axons to a terminal field residing between the vertical and horizontal lobes. A memory 
trace, observed as an increased calcium response to the GS-i- odor after aversive conditioning and a decreased response to the GS-i- odor after appetitive conditioning, is observed in the dendrites 
of these neurons (Owald et al., 2015). Synaptic output from a small group of cholinergic neurons (MBOn-V2a, orange) that project dendrites into a defined segment of the vertical lobe is also 
required for the expression of aversive ITM and LTM (Sejourne et al., 2011). These neurons project their axons to lateral regions of the brain. A memory trace, observed as a decreased calcium 
response to the GS-i- odor after aversive conditioning, is observed in the dendrites of these neurons. The relative activity of these and other MBOn with dendritic arbors in other segments of the 
MB lobes is thought to capture and integrate the activity of MBn to help drive avoidance or approach behavior (Aso et ah, 2014). 

ABBREVIATIONS 

GS, conditioned stimulus; US, unconditioned stimulus; ORn, olfactory receptor neuron; Pn, projection neuron; MBn, mushroom body neuron; DA, dopamine; DAn, dopamine neuron; AGh, acetyl- 
choline; APLn, anterior paired lateral neuron; STM, short-term memory; ITM, intermediate-term memory; AN, antennal nerve; LTM, long-term memory; DPMn, dorsal paired medial neuron; DALn, 
dorsal anterior lateral neuron; G, mushroom body calyx; MBOn, mushroom body output neuron. 
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